Proclus Academy
Home
Archive
Tags
Posts
Machine Learning / Deep Learning
2023
Sep 8
How to Slice Pandas DataFrames Using iloc and loc
Pandas
iloc
loc
May 18
Pandas: Why and How to Use idxmin() and idxmax()
Pandas
idxmin()
idxmax()
QuickTip
Apr 24
Pandas: How to Read Data From HTML Tables
Pandas
HTML
Apr 16
Pandas: How to Read and Write Data to a SQL Database
Pandas
Database
MySQL
SQLAlchemy
Parameterized Query
Mar 26
How to Read and Write Excel Files Using Pandas
Pandas
Excel
Dataset: USA Cities
Mar 14
Pandas: A Quick and Practical Guide for Beginners
Pandas
Visualization
Dataset: Retail Sales
Feb 27
Measures of Spread: MAD, Variance, and Standard Deviation
Statistics
Measures of Spread
Variance
Standard Deviation
Mean Absolute Deviation (MAD)
Feb 3
Measures of Central Tendency: Mean, Median, Weighted Mean, and Mode
Statistics
Measures of Central Tendency
Mean
Median
Weighted Mean
Mode
Outliers
Jan 1
Master Machine Learning and Deep Learning in 2023: A Self-Study Guide
Self-Study Guide
2022
Dec 10
Normal Distribution: A Practical Guide Using Python and SciPy
Normal Distribution
SciPy
Statistics
Data Distribution
Visualization
Matplotlib
Histogram
Density Curve
Percentile
Nov 25
Draw Dot Plot Using Python and Matplotlib
Visualization
Dot Plot
Matplotlib
Data Distribution
Seaborn
Nov 8
Precision, Recall, and F1 Score: A Practical Guide Using Scikit-Learn
Classification Metrics
Accuracy
Accuracy Paradox
Precision
Recall
F1 Score
LogisticRegression
Dataset: ISLR Default
Classification
Nov 8
Precision, Recall, and F1 Score: When Accuracy Betrays You
Classification Metrics
Accuracy
Accuracy Paradox
Precision
Recall
F1 Score
Classification
Oct 14
3 Regression Metrics You Must Know: MAE, MSE, and RMSE
Regression Metrics
MAE
MSE
RMSE
Dataset: Heights and Weights
Regression
Linear Regression
Oct 2
Normal Distribution and the Empirical Rule
Normal Distribution
Empirical Rule
Statistics
Data Distribution
Sep 7
Area Under Density Curve: How to Visualize and Calculate Using Python
Statistics
Data Distribution
Density Curve
Area Under Density Curve
Percentile
Visualization
Matplotlib
Seaborn
Aug 21
Data Distribution, Histogram, and Density Curve: A Practical Guide
Statistics
Data Distribution
Visualization
Histogram
Density Curve
Matplotlib
Seaborn
Frequency Table
Line Plot
Jul 24
How to Customize Pie Charts using Matplotlib
Visualization
Matplotlib
Pie Chart
Seaborn
Donut Chart
Jul 11
What Is Stratified Sampling and How to Do It Using Pandas?
Statistics
Sampling
Simple Random Sampling
Stratified Sampling
Pandas
Dataset: Palmer Penguins
Jul 3
How to Generate Datasets Using make_classification
Classification
make_classification()
RandomForestClassifier
Jun 11
Accuracy and Confusion Matrix Using Scikit-Learn & Seaborn
Classification Metrics
Confusion Matrix
Accuracy
Matplotlib
Seaborn
Visualization
AdaBoostClassifier
Dataset: Pima Diabetes
Classification
Jun 11
Using Confusion Matrix and Accuracy to Test Classification Models
Classification Metrics
Confusion Matrix
Accuracy
Classification
May 29
K-Fold Cross-Validation Using Python and Scikit-Learn
Cross-Validation
cross_val_score()
cross_validate()
Model Selection
RandomForestClassifier
Dataset: Banknote Authentication
May 29
What Is K-Fold Cross-Validation?
Cross-Validation
Train Test Split
Model Selection
Mar 30
Boxplot With Separate Y-Axis for Each Column
Visualization
Boxplot
Matplotlib
Seaborn
Pandas
QuickTip
Dataset: Real Estate Evaluation
Mar 22
Robust Scaling: Why and How to Use It to Handle Outliers
Scaling
Robust Scaler
Outliers
Statistics
Data Preparation
Mar 21
Summary Statistics Using Pandas
Pandas
Statistics
QuickTip
Mar 9
Use Standard and MinMax Scaling to Tame Numerical Features
Scaling
Standard Scaler
MinMax Scaler
Statistics
Data Preparation
Dataset: Real Estate Evaluation
Feb 25
Use ‘Train Test Split’ to Beat Overfitting
Train Test Split
Overfitting
Underfitting
Polynomial Regression
Pipeline
Model Selection
Dataset: Combined Cycle Power Plant
Feb 15
ColumnTransformer: Why and How to Use It to Preprocess Data
ColumnTransformer
Data Preparation
Dataset: Titanic