__STYLES__

Your browser is not supported. Please download another browser to be able to use all of the Maven features.

Self-Paced Course

Data Science in Python: Classification

Master the foundations of classification modeling in Python, including KNN, logistic regression, decision trees, random forests, and gradient boosted machines

Course Hours16 hours
Skills Learned
Data Analysis
Machine Learning
Tools
Python
Course Level
Intermediate
Credentials
Paths

Course Description

This is a hands-on, project-based course designed to help you master the foundations for classification modeling in Python.

We’ll start by reviewing the data science workflow, discussing the primary goals & types of classification algorithms, and do a deep dive into the classification modeling steps we’ll be using throughout the course.

You’ll learn to perform exploratory data analysis, leverage feature engineering techniques like scaling, dummy variables, and binning, and prepare data for modeling by splitting it into train, test, and validation datasets.

From there, we’ll fit K-Nearest Neighbors & Logistic Regression models, and build an intuition for interpreting their coefficients and evaluating their performance using tools like confusion matrices and metrics like accuracy, precision, and recall. We’ll also cover techniques for modeling imbalanced data, including threshold tuning, sampling methods like oversampling & SMOTE, and adjusting class weights in the model cost function.

Throughout the course, you'll play the role of Data Scientist for the risk management department at Maven National Bank. Using the skills you learn throughout the course, you'll use Python to explore their data and build classification models to accurately determine which customers have high, medium, and low credit risk based on their profiles.

Last but not least, you'll learn to build and evaluate decision tree models for classification. You’ll fit, visualize, and fine tune these models using Python, then apply your knowledge to more advanced ensemble models like random forests and gradient boosted machines.

If you're an aspiring data scientist looking for an introduction to the world of classification modeling with Python, this is the course for you.

COURSE CONTENTS:

  • 9.5 hours on-demand video

  • 18 homework assignments

  • 9 quizzes

  • 2 projects

  • 2 skills assessments (1 benchmark, 1 final)

COURSE CURRICULUM:

WHO SHOULD TAKE THIS COURSE?

  • Data analysts or BI experts looking to transition into a data science role

  • Python users who want to build the core skills for applying classification models in Python

  • Anyone interested in learning one of the most popular open source programming languages in the world

WHAT ARE THE COURSE REQUIREMENTS?

  • We strongly recommend taking our Data Prep & EDA and Regression courses first
  • Jupyter Notebooks (free download, we'll walk through the install)
  • Familiarity with base Python and Pandas is recommended, but not required

Start learning for FREE, no credit card required!

Every subscription includes access to the following course materials

  • Interactive Project files
  • Downloadable e-books
  • Graded quizzes and assessments
  • 1-on-1 Expert support
  • 100% satisfaction guarantee
  • Verified credentials & accredited badges
Sign Up Today

Ready to become a

data rockstar?

Start learning for free, no credit card required!

Sign Up for Free