DSMP by CampusX¶

Capstone Project 5 - Uber Demand Prediction¶

Session 1 - Project Overview¶

Description

Notes - https://drive.google.com/file/d/139OjwnkomO4w4GatdectPe2DRgNNnWo1/view?usp=sharing
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 2 (Part 1) - EDA¶

Description

Notes -https://drive.google.com/file/d/15FAmn612n7AAHNfOxnM2uWIjVz-hLD3Y/view?usp=sharing
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 2 (Part 2) - Demand Prediction EDA¶

Description

Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 3 - Breaking New York into regions¶

Description

Notes - https://drive.google.com/file/d/1KvwLO6x5tSx_i1MdZe5R0R4grEUtgCyB/view?usp=sharing
Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 4 - Creating Historical Data¶

Description

Notes - https://drive.google.com/file/d/1E-az1lqZkW5KssYtrb41iRo8jKxRaawk/view?usp=sharing
Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 5 - Training a baseline model¶

Description

Notes - https://drive.google.com/file/d/1QJZcWt3h_dPUF2cRJiSb-jMQU6w7oHS7/view?usp=sharing
Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 6 - Model Selection and HP Tuning¶

Description

Notes - https://drive.google.com/file/d/1mBIjvI0yF9MLgNYJM07KYb0W0f5dki_1/view?usp=sharing
Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv
 

Session 7 - Building the DVC Pipeline¶

Description

Code - https://github.com/Himanshu-1703/uber-demand-prediction
Notes - https://drive.google.com/file/d/1Lvo2GxEESL2KFWHCLMZULd4YnYQAxF6s/view
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 8 - Building the Streamlit Application¶

Description

Notes - https://drive.google.com/file/d/119HV3A7UcCKBQmJ0Dpm26pJb5UCdaEd9/view?usp=sharing
Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 9 - Building the CI Pipeline¶

Description

Notes - https://drive.google.com/file/d/1ywmObgr0TbDg71lAINFSUZP1Ixhlz2fQ/view?usp=sharing
Code - https://github.com/Himanshu-1703/uber-demand-prediction
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Session 10 - Deployment using AWS CodeDeploy¶

Description

Code - https://github.com/Himanshu-1703/uber-demand-prediction
Notes - https://drive.google.com/file/d/10Q3uZdd1egIeC29JMEqTQbLMcjNtcV4I/view?usp=sharing
Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2016-03.csv

Capstone Project 4 | Hybrid Recommender System¶

Session 1 - Project Overview¶

Description

Notes - https://drive.google.com/file/d/1ACfEnspTRsdEMbliNxkYjDev9KEpWHN3/view?usp=drive_link

drive.google.com

Session 2 - EDA¶

Description

Code- https://colab.research.google.com/drive/1wSrwGaPHt9NyJ5nz2Evna1hYZVbykAKo?usp=sharing
Notes - https://drive.google.com/file/d/1-SH_voCRjAFclT91FB1IOgjsHwkQDaCj/view?usp=drive_link

Session 3 (Part 1) - Content Based Recommender System¶

Description

Code - https://colab.research.google.com/drive/1NSqfgdkOL4Ztnp47-VIcTpJ0EPlRhElN?usp=sharing
Notes - https://drive.google.com/file/d/1-X7BSJORF_jOtMYL8da8McDoaB4X9DXE/view?usp=drive_link

Session 3 (Part 2) - Content Based Recommender System¶

Description

Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

github.com

Session 4 (Part 1) - Collaborative Filtering Based Recommender System¶

Description

Notes - https://drive.google.com/file/d/108gCpnCQDjnh6nKfTkugf5UsLvH-1nir/view?usp=drive_link
Code - https://colab.research.google.com/drive/1c5TIgUu4xeqHBnELH41U6qU9418UYux7?usp=sharing

Session 4 (Part 2) - Collaborative Filtering Based Recommender System¶

Description

Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

github.com

Session 5 - Building the Hybrid Recommender System¶

Description

Notes - https://drive.google.com/file/d/10D7SoZgk2nQ2rTQORdSZVVvvpDlRn6cm/view?usp=drive_link
Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

Session 6 - Improving Hybrid Recommender System¶

Description

Notes - https://drive.google.com/file/d/10DoSf0SRw2Be_RohcF_LU06CW7Cm_3dq/view?usp=drive_link
Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

Session 7 - DVC Pipeline & CI¶

Description

Notes - https://drive.google.com/file/d/10owhy_FY_ioZPM5LLz_TKMIRMyZtecra/view?usp=drive_link
Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

Session 8 - Dockers and CD¶

Description

Notes - https://drive.google.com/file/d/1192mlHMESug16v9PptW5QFhLTK8lTSBb/view?usp=drive_link
Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

Session 9 - Deployment on AWS¶

Description

Notes - https://drive.google.com/file/d/11AeHwazS9TivCwthex48HVJ0Lgt3FYkw/view?usp=drive_link
Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system
Command
Commands to install Docker and AWS CLI V2 on EC2 instances.
# Update the package lists

sudo apt-get update -y
# Install Docker

sudo apt-get install -y docker.io
# Start and enable Docker service

sudo systemctl start docker

sudo systemctl enable docker
# Add 'ubuntu' user to the 'docker' group to run Docker commands without 'sudo'

sudo usermod -aG docker ubuntu
# Install necessary utilities

sudo apt-get install -y unzip curl
# Download and install AWS CLI

curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "/home/ubuntu/awscliv2.zip"

unzip -o /home/ubuntu/awscliv2.zip -d /home/ubuntu/

sudo /home/ubuntu/aws/install
# Clean up the AWS CLI installation files

rm -rf /home/ubuntu/awscliv2.zip /home/ubuntu/aws
---------------------------------------------------------------------------------------------------------------------------------------
Docker Commands
# login and authenticate

aws ecr get-login-password --region ap-south-1 | docker login --username AWS --password-stdin 891377050051.dkr.ecr.ap-south-1.amazonaws.com
# pull the docker image

docker pull 891377050051.dkr.ecr.ap-south-1.amazonaws.com/spotify_hybrid_recsys:latest
# change image tags

docker tag 891377050051.dkr.ecr.ap-south-1.amazonaws.com/spotify_hybrid_recsys:latest spotify_hybrid_recsys:latest 
# Run the container

docker run -d --name hybrid_recsys -p 8000:8000 spotify_hybrid_recsys:latest

Session 10 (Part 1) - Blue Green Deployment Phase 1¶

Description

Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system
Notes - https://drive.google.com/file/d/12CN3bd2V2ZdoZe_RB3YxLwunFpAv2aiq/view?usp=drive_link
Commands
Install CodeDeploy Agent on each instance using the Launch Template
#!/bin/bash
# update packages

sudo apt update -y
# install ruby required for code deploy

sudo apt install ruby-full -y
# get additional packages

sudo apt install wget -y

cd /home/ubuntu
# import the agent

wget https://aws-codedeploy-ap-south-1.s3.ap-south-1.amazonaws.com/latest/install
# install the agent

chmod +x ./install

sudo ./install auto
# run the agent

sudo systemctl start codedeploy-agent

Session 10 (Part 2) - Blue Green Deployment Phase 2¶

Description

Notes - https://drive.google.com/file/d/12CN3bd2V2ZdoZe_RB3YxLwunFpAv2aiq/view?usp=drive_link
Code - https://github.com/Himanshu-1703/spotify-hybrid-recommender-system

Capstone Project 3 | Swiggy Delivery Time Prediction¶

Session 1 - Understanding the Problem Statement¶

Description

Note: If you experience any rendering issues with the notes on Github, please download them to your system and open them locally in any IDE.
Updated Resources - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction

github.com

Session 2 - Data Cleaning¶

Description

Note: If you experience any rendering issues with the notes on Github, please download them to your system and open them locally in any IDE.
Updated Resources - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
 
Notes - https://drive.google.com/file/d/11NK6LBoPQjtXok82OT322UjDhlU5VUYa/view?usp=drive_link
Code - https://colab.research.google.com/drive/1POTfVyjYOgFAGwnwDnf8VdCkCw3wX0Uc?usp=sharing
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view?usp=sharing

Session 3 - EDA¶

Description

Note: If you experience any rendering issues with the notes on Github, please download them to your system and open them locally in any IDE.
Updated Resources - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
 
Code:
https://drive.google.com/file/d/11mbsmgD4h4MdGy0lIY1gbrkK1wv76R4R/view?usp=sharing
https://drive.google.com/file/d/11bsljQCrSwE4XETGnV2oboxLhafa0rxa/view?usp=drive_link
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view?usp=drive_link
Notes - https://drive.google.com/file/d/11fgYT0uHWt9rKhiPxHO__hwv9RfMX4ZR/view?usp=drive_link

Session 4 - Building a baseline model¶

Description

Note: If you experience any rendering issues with the notes on Github, please download them to your system and open them locally in any IDE.
Updated Resources - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
 
Code - https://drive.google.com/file/d/11bsljQCrSwE4XETGnV2oboxLhafa0rxa/view
https://colab.research.google.com/drive/12bB6Q1Ea3P_HEt20n-bXudGxeViPDgqg?usp=sharing
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view
Notes - https://drive.google.com/file/d/12-ySN8WSXcCIsgOp0M2lDFDb7fTR0bq6/view

Session 5 (Part 1) - Experimentation Part 1¶

Description

Note: If you experience any rendering issues with the notes on Github, please download them to your system and open them locally in any IDE.
Updated Resources - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
Code:
https://colab.research.google.com/drive/1tAW-Lz0km1DmXZaNnqykEtoDCIi66fX1?usp=sharing
https://colab.research.google.com/drive/1lUUwCHqX7pdAiro8G4PmLVfLiSjS7_m9?usp=sharing
https://colab.research.google.com/drive/19WpmGfOWxHHT0r-7mOQOnobSf9zCfuPP?usp=sharing
https://drive.google.com/file/d/11bsljQCrSwE4XETGnV2oboxLhafa0rxa/view?usp=drive_link
https://colab.research.google.com/drive/1hdbgTC8eCxQkaZDwt0oetF48YmmgyjqB?usp=sharing
https://colab.research.google.com/drive/1M8R39bCnlLPkeXdaWwYMNDhscz6ggBCr?usp=sharing
https://colab.research.google.com/drive/1iEBmzGKBUQY8s2BIy2J88WOQDFAjgcvS?usp=sharing
Notes - https://drive.google.com/file/d/12ZCxUOCpaQvJimpSbNni9rEDi436kPRz/view?usp=drive_link
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view?usp=drive_link
Github Repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
 
 

Session 5 (Part 2) - Experimentation Part 2¶

Description

Note: If you experience any rendering issues with the notes on Github, please download them to your system and open them locally in any IDE.
Updated Resources - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
 
Code - https://colab.research.google.com/drive/11tMj_IBDiVHd4c2gpLpRSttQvi9obID1?usp=sharing

Session 6 - Building the DVC Pipeline Workflow¶

Description

Github Repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view?usp=drive_link
Notes - https://drive.google.com/file/d/14jZjSoSRcYvI1QHqkI_l2c2ZD63Mu4Zs/view?usp=sharing

Session 7 - Model Registry and Building the API¶

Description

Notes - https://drive.google.com/file/d/151g0TwnHxRnBWxMnJMVa0Hn0qQvERFrm/view?usp=drive_link
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view?usp=drive_link
Github repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction

Session 8 (Part 1) - Model Testing¶

Description

Notes - https://drive.google.com/file/d/154umLi_fSLDuEuuI8zZCgTsciNIKgqnd/view
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view
Repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction

Session 8 (Part 2) - CI¶

Description

Notes - https://drive.google.com/file/d/154umLi_fSLDuEuuI8zZCgTsciNIKgqnd/view
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view
Repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction

Session 9 - Dockers and CD¶

Description

Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view
Repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction

Session 10 (Part 1) - Deployment on AWS¶

Description

Notes - https://drive.google.com/file/d/176kdNIdhHERq3Jzh_KRjZZNOBr3cKdv6/view

drive.google.com

Session 10 (Part 2) - Deployment on AWS¶

Description

Steps - https://drive.google.com/file/d/179Rm0_IG-6u99VjMtKruWk4fCmM_jl67/view?usp=sharing
Github Repo - https://github.com/Himanshu-1703/swiggy-delivery-time-prediction
Dataset - https://drive.google.com/file/d/11Na0FCU5grOqY-vyKfy7T4Ge1IUdWMIN/view?usp=drive_link

Capstone Project 2 | YouTube Comment Analysis¶

Session 1 - Project Planning¶

Description

Notes - https://drive.google.com/file/d/1TgIpAhdltm8OVoMY7IAk__2tieAOCAOX/view?usp=sharing

drive.google.com

Session 2 - Preprocessing & EDA¶

Description

Code - https://colab.research.google.com/drive/1yM7Tgkc63jzpu9Bpbd0TUBbstGyeG4El?usp=sharing

colab.research.google.com

Session 3 - Building a Baseline Model¶

Description

Code - https://colab.research.google.com/drive/1P0ljSEbL2blaPdBilgUp9XullOk4rBgV?usp=sharing

colab.research.google.com

Session 4 (Part 1) - Improving the Baseline Model¶

Description

Codes
https://colab.research.google.com/drive/1bKQGgNJG3N6zHe6fZYdobUBH9BirI9xY?usp=sharing
https://colab.research.google.com/drive/1jhrNYo7a523RK3X7Kz4fGG-UfEoap4I_?usp=sharing

colab.research.google.com
colab.research.google.com{ target="blank" title="https://colab.research.google.com/drive/1jhrNYo7a523RK3X7Kz4fGG-UfEoap4I?usp=sharing" }

Session 4 (Part 2) - Improving the Baseline Model¶

Description

Codes:
https://colab.research.google.com/drive/1Xve9ljMn3VMse9C5cl4UYsW07bOOfv_8?usp=sharing
https://colab.research.google.com/drive/1aoOAt8lBKG9uri0XZSoz-6_ucyYEHWpM?usp=sharing
https://colab.research.google.com/drive/1ZAWzrxxWTj9zgyjfsmaFWSpYl4VoU0O1?usp=sharing
https://colab.research.google.com/drive/13-tbBUHakidGX32wVlOi0ZfMDBxffn1k?usp=sharing
https://colab.research.google.com/drive/1Y446h7pdn0C_2ITM-N6oI89fdt90RjJT?usp=sharing
https://colab.research.google.com/drive/1WJLt7d95WOmVLwXEZl61_1EuzVW_AjVi?usp=sharing
https://colab.research.google.com/drive/1tjqqb-rLyrFOioEqc4vgyfhhfYEEKjKv?usp=sharing
https://colab.research.google.com/drive/14mIeAQT3HcxF_1BbhAUahpmNV8CJw1vH?usp=sharing
https://colab.research.google.com/drive/1dyifRVHZ-rO0rXD60elgRIGYatwO2MN0?usp=sharing

Session 5 - Improving the LightGBM model¶

Description

Codes:
https://colab.research.google.com/drive/1mCRrndK4qzadoGMCe3SZE9BPK4lIdwzb?usp=sharing
https://colab.research.google.com/drive/1ZwgHCBrYwdHKf_tJwMRj_HOGmiE5BmM_?usp=sharing
https://colab.research.google.com/drive/1nD4a0oL-8hM3G8rBYqtSlzKBIszhXIlJ?usp=sharing
https://colab.research.google.com/drive/1_7abUn-Ffd22Zux27-yovOzNk6KJscZt?usp=sharing
 

colab.research.google.com{ target="blank" title="https://colab.research.google.com/drive/1ZwgHCBrYwdHKf_tJwMRj_HOGmiE5BmM?usp=sharing" }
colab.research.google.com
colab.research.google.com
colab.research.google.com

Session 6 - Building the DVC Pipeline¶

Description

Code - 
https://www.kaggle.com/code/sampsuman/sentiment-analysis-bert-reddit-data
https://github.com/campusx-official/yt-comment-sentiment-analysis/tree/master/src/model

Session 7 - Adding Model Registry¶

Description

Code 
https://github.com/campusx-official/yt-comment-sentiment-analysis/

Session 8 - Building the Chrome Plugin Part 1¶

Description

Code 
https://github.com/campusx-official/yt-chrome-plugin-frontend
https://github.com/campusx-official/yt-comment-sentiment-analysis

Session 9 - Building the Chrome Plugin Part 2¶

Description

Notes - https://drive.google.com/file/d/1q7JLwuivBowIcaKMDdWqKX1mYuG9ikc4/view?usp=sharing
Code 
https://github.com/campusx-official/yt-chrome-plugin-frontend
https://github.com/campusx-official/yt-comment-sentiment-analysis
 

Session 10 (Part 1) - Adding the CI workflow¶

Description

Code - https://github.com/campusx-official/yt-comment-sentiment-analysis

github.com

Session 10 (Part 2) - Adding the CI Workflow¶

Description

Code - https://github.com/campusx-official/yt-comment-sentiment-analysis

github.com

Session 11 - Dockerization¶

Description

Code - https://github.com/campusx-official/yt-comment-sentiment-analysis

github.com

Session 12 - Deployment¶

Description

Codes:
https://github.com/campusx-official/yt-comment-sentiment-analysis
https://github.com/campusx-official/yt-chrome-plugin-frontend
- create a new launch template

    - create a new IAM role - EC2_Codedeploy_role -> policy - AmazonEC2RoleforAWSCodeDeploy

    - create a new IAM role - EC2_ECR_role -> policy - AmazonEC2ContainerRegistryReadOnly

    - install and codedeploy agent in using User data
********************************************************************************************

#!/bin/bash
# Update the package list

sudo apt-get update -y
# Install Ruby (required by the CodeDeploy agent)

sudo apt-get install ruby -y
# Download the CodeDeploy agent installer from the correct region

wget https://aws-codedeploy-ap-southeast-2.s3.ap-southeast-2.amazonaws.com/latest/install
# Make the installer executable

chmod +x ./install
# Install the CodeDeploy agent

sudo ./install auto
# Start the CodeDeploy agent

sudo service codedeploy-agent start
**********************************************************************************************

- create a new ASG using the above launch template
- check if codedeploy agent is running - sudo service codedeploy-agent status
- create a new codedeploy application
- create a deployment group -> Service role -> CodeDeployServiceRole -> AWSCodeDeployRole
- create appspec.yml deploy/scripts/install_dependencies.sh and start_docker.sh
- update CICD.yaml
- create S3 bucket
- create new deployment
- monitor the deployment
- check the docker application -> test in postman
- check the chrome plugin by editing the url

Extra Sessions¶

Interview Questions on Statistics¶

Description

Feedback Form  - https://forms.gle/5vGSrcVs1LrKd8Mo9

forms.gle

Model Explainability¶

Description

Feedback Form - https://forms.gle/14bjmVMSL1GUFPXy5

forms.gle

Interview Questions on Regression¶

Description

Feedback Form - https://forms.gle/GFbzkKyStkFPn6t9A

forms.gle

How to Solve a Banking Problem using ML¶

Description

Feedback Form - https://forms.gle/2E94oLoMsWwrysvB8

forms.gle

Session on ResNET Paper Discussion¶

Description

Feedback Form - https://forms.gle/7p2wCSjoC7UDRox99

forms.gle

Introduction to PyTorch¶

Description

Feedback Form - https://forms.gle/SeGGzLa1pUZWvTvP9

forms.gle

Named Entity Recognition using NLTK & Spacy¶

Description

Feedback Form - https://forms.gle/M1KszATvuEgXWJW57

forms.gle

Latent Dirichlet Allocation (LDA)¶

Description

Feedback Form - https://forms.gle/EQk2F3mg1Hgqjgyx7

forms.gle

Introduction to PowerBI¶

Description

Feedback Form - https://forms.gle/8YMKA71gS9bos6Ns8

forms.gle

Anomaly Detection¶

Description

Feedback Form - https://forms.gle/2Ck2kFvSWqRptiaR6

forms.gle

Prompt Engineering¶

Description

Feedback Form - https://forms.gle/wYXG79C48X5WLdqe7
Notebook - https://colab.research.google.com/drive/1ukpKSCDsY8vmgdB12oTMTol7YCrbls9Q?usp=sharing

Interview Questions on Tree Based Models¶

Description

Feedback Form - https://forms.gle/naq7hYB4wr9B1jCq8

forms.gle

Multioutput and Multiclass Classification Problem¶

Description

Feedback Form - https://forms.gle/UPQVWEQ7vzWPh3PP8

forms.gle

EKYC Using Computer Vision¶

Description

Feedback Form - https://forms.gle/GAyVY3hqZK4eLK3D7

forms.gle

Time Series Forecasting¶

Description

Feedback Form - https://forms.gle/cKyz2gLPLDNsFRv49

forms.gle

A/B Testing¶

Description

Feedback Form - https://forms.gle/qXj3NkuJGMVDvEY86

forms.gle

Langchain¶

Description

Feedback Form - https://forms.gle/mtLS482RCXEfGu1R6

forms.gle

FastAPI¶

Description

Feedback Form - https://forms.gle/DP2xVdUxscsAcoPm8

forms.gle

Vertex AI¶

Description

Feedback - https://forms.gle/dvWxrHCHNccbHi6p9

forms.gle

RAG¶

Description

Feedback - https://forms.gle/NdHpDDgw9beYZm786

forms.gle

MLOps Revisited¶

Session 1 MLOps Revisited - Introduction to MLOps¶

Description

Notes - https://drive.google.com/file/d/1SN5eHzhvefu4PJmVQ1Qp6ADTO6KBQd5S/view?usp=sharing

drive.google.com

Session 2 on MLOps Revisited - MLOps Tools Stack¶

Description

Notes - https://drive.google.com/file/d/15eH2h74B7bSsb2J38HCAMTIzEboDqVLm/view?usp=sharing

drive.google.com

Prometheus Part-3¶

Description

Resources - https://github.com/Himanshu-1703/prometheus-test-target

github.com

Monitoring on AWS¶

Description

Resources - 

https://github.com/Himanshu-1703/model-monitoring-demo.git

github.com

Monitoring on ASG¶

Description

Resources - 

https://github.com/Himanshu-1703/monitoring-sentiment-analysis-project.git

github.com

Alerting on AWS (Complete)¶

Description

Resources - https://github.com/Himanshu-1703/monitoring-sentiment-analysis-project.git

github.com

Model Retraining Implementation¶

Description

Resources - 

https://github.com/Himanshu-1703/monitoring-sentiment-analysis-project.git

github.com

Interview Questions¶

Session 1 on Interview Questions on Statistics¶

Description

Please give feedback
https://forms.gle/GyJRLaF61S3p3RYM6

forms.gle

Session on Project Based Interview Questions¶

Description

Feedback - https://forms.gle/Z6eJ2KGW8PD6m3kq6

forms.gle

Session 1 on ML Interview Questions¶

Description

Feedback - https://forms.gle/BuHb3bKNuiDEKq54A

forms.gle

Recording - Session 3 on ML Interview Questions¶

Description

Feedback Form - https://forms.gle/grsYYRzmG8YasmCQ7

forms.gle

Miscellaneous Topics¶

Session 1 on Imbalanced Data - Introduction¶

Description

Code - https://colab.research.google.com/drive/1om2tWp4mXsMFlIz6hx91cxEE8gGPgnb8?usp=sharing
Notes - https://drive.google.com/file/d/1YlemM7FSzMiAiIVcb7WXvlckcM60U6k8/view?usp=drive_link

Session 2 on Imbalanced Data - Oversampling Techniques¶

Description

Code - https://colab.research.google.com/drive/1hzraH16KcNGnfGM_qw_UOvB2lFH4E7f-?usp=sharing
Notes - https://drive.google.com/file/d/1dgEFWsxp02my40-aNlx_VdAvsMtALt8u/view?usp=sharing

Session 3 on Imbalanced Data - Undersampling Techniques¶

Description

Code - https://colab.research.google.com/drive/10Bb-292f7_eYdGTjzSEtmT_6Gzp1LvDr?usp=sharing

colab.research.google.com

Other Boosting Frameworks¶

Session 1 on Introduction to LightGBM¶

Description

Code - https://colab.research.google.com/drive/1ZeqDKHFrgS96K1guNXKINEuVXRg-8LxL?usp=sharing

colab.research.google.com

Session 2 on LightGBM (GOSS & EFB)¶

Description

Research Paper - https://proceedings.neurips.cc/paper_files/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf

proceedings.neurips.cc

Session 1 on CatBoost - Practical Introduction¶

Description

Code - https://colab.research.google.com/drive/1EK3EDWkl0V6bpJ0AXO1IhYNCyZalUPPj?usp=sharing

colab.research.google.com

Advanced XGBoost¶

Session on XGBoost Regularization¶

Description

Code - https://colab.research.google.com/drive/1LA93NSEyfLhjwMTVHy5o7sbU97mfhLdt?usp=sharing
Notes - https://drive.google.com/file/d/1yvsTgKBmqgwnAswr15lagQe8d9UJGnlE/view?usp=sharing

Session 2 on XGBoost Regularization¶

Description

Code - https://colab.research.google.com/drive/1LA93NSEyfLhjwMTVHy5o7sbU97mfhLdt?usp=sharing
Notes - https://drive.google.com/file/d/1yvsTgKBmqgwnAswr15lagQe8d9UJGnlE/view?usp=sharing
 

Session on XGBoost Optimizations¶

Description

Notes - https://drive.google.com/file/d/1P3B0X6Vahs6NNxovCG8cdH2ETARMkXym/view?usp=sharing
Code - https://colab.research.google.com/drive/1yLdcPwZMFTcz7xLlgGfULbw3sj_UgG2g?usp=sharing

How XGBoost Handles Missing Values¶

Description

Code - https://colab.research.google.com/drive/17qgja8XHnjCLIXTOdxSEwcSBwQdR4Y-Q?usp=sharing

colab.research.google.com

Feature Engineering¶

Session 1 on Encoding Categorical Features¶

Description

https://colab.research.google.com/drive/1PIZQpOTZgXMpQUQ0SUT51RpfvH-jVSFi?usp=sharing
Dataset 1 - https://drive.google.com/file/d/1B0YNqPgjTat67SAc5nIpeNSDWRJQRz9e/view?usp=sharing
Dataset 2 - https://drive.google.com/file/d/1a9kmZni3NJqEP2-7v4oHbPMr9UgoUnnQ/view?usp=sharing
Notes - https://drive.google.com/file/d/1yueCF-CJU7p8lag9GTAumiuVE4pDOXXu/view?usp=sharing

Session on Sklearn ColumnTransformer & Pipeline¶

Description

Code - https://colab.research.google.com/drive/1M5NhzUn5QjXQKfdFWpbZwUBrT0bCAZpY?usp=sharing

colab.research.google.com

Session on Sklearn Deep Dive¶

Description

Code - https://colab.research.google.com/drive/1loDInKlSYJffsLo0HoZ9TZuJ87RCk07j?usp=sharing
Notes - https://drive.google.com/file/d/1BQQXt-l7_rQ9SJlc7my77GyRk_fuIlYM/view?usp=sharing

Session 2 on Encoding Categorical Features¶

Description

Code - https://colab.research.google.com/drive/1PIZQpOTZgXMpQUQ0SUT51RpfvH-jVSFi?usp=sharing
Notes - https://drive.google.com/file/d/1bkrsPkLBm2LGyh1auF1eZmP5SQ5BgfOW/view?usp=sharing

Session 1 on Discretization¶

Description

Code - https://colab.research.google.com/drive/1LINzV5JO6pHHuYdfL7JzGZAL2F6TI0bU?usp=sharing

colab.research.google.com

Session 2 on Discretization¶

Description

https://colab.research.google.com/drive/1HnjOqgejNIuTLH4UycmiUrUaR8AWKUUv?usp=sharing
https://scikit-learn.org/stable/auto_examples/preprocessing/plot_discretization_strategies.html#sphx-glr-auto-examples-preprocessing-plot-discretization-strategies-py
 
https://drive.google.com/file/d/15qXxo_w6YXqjwJzR0Yfc7ux0DBXuU7dR/view?usp=sharing

Session 1 on Handling Missing Data¶

Description

Code - https://colab.research.google.com/drive/1ssz2Gd2YrwYFdPu824K0ypNMWouiigNH?usp=sharing
https://github.com/ResidentMario/missingno
Notes - https://drive.google.com/file/d/1HD5G6u54Yc5K3oY6pW0wJKtD18ccJEda/view?usp=sharing

Session 2 on Handling Missing Data¶

Description

Code - https://colab.research.google.com/drive/1ssz2Gd2YrwYFdPu824K0ypNMWouiigNH?usp=sharing
Code - https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day38-missing-indicator/automatically-select-imputer-parameters.ipynb
Notes - https://drive.google.com/file/d/1tPv7bp3QpAgBGmKf6LNHeQtq0v1r3vpp/view?usp=sharing
 

Session 3 on Handling Missing Values¶

Description

Code - https://colab.research.google.com/drive/1XEb6W8Fwy7dXhGKSfUGeMBwSz9W8fI9N?usp=sharing
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day40-iterative-imputer/step-by-step.ipynb
https://colab.research.google.com/drive/1WpPutTwFqxw-nONiqZStqEO6CH0Proo4?usp=sharing
Notes - https://drive.google.com/file/d/1Qgayi8feSqj2E4kZvLyGu3ENu51qXN6p/view?usp=sharing

Session on Feature Scaling¶

Description

Code - https://colab.research.google.com/drive/1pWB7wzkizH_6p9bhZqhE2K6R5CKiEa2V?usp=sharing

colab.research.google.com

Session 2 on Feature Scaling¶

Description

Code - https://colab.research.google.com/drive/1hnN2EccC5lQpV33yXO0HHFwdY4F2QsdM?usp=sharing
Notes - https://drive.google.com/file/d/1OMUgHQTrsxZK6etU09KImUotRTbdpvhx/view?usp=sharing

Session 1 on Outlier Detection¶

Description

Code - https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day42-outlier-removal-using-zscore/day42.ipynb#scrollTo=SW3_GmfYRdZ2
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day43-outlier-removal-using-iqr-method/day43.ipynb#scrollTo=Y_YfE9gfRdEF

Session 2 on Outlier Detection¶

Description

Code - https://colab.research.google.com/drive/1mhBMpq2q_8i0VRr0TGpFBm3IFcCdFt0n?usp=sharing

colab.research.google.com

Session 3 on Outlier Detection¶

Description

Notes - https://drive.google.com/file/d/1B2-KJxcrh5s640D_BXGrXjP_uU-cCpIY/view?usp=drive_link
Code - https://colab.research.google.com/drive/1mhBMpq2q_8i0VRr0TGpFBm3IFcCdFt0n?usp=sharing

Session on Feature Transformation¶

Description

Codes - https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day30-function-transformer/day30.ipynb
https://colab.research.google.com/drive/1uIM7IPLlqCZUndpynUuYo1be2qG52Hcx?usp=sharing
 
Notes - https://drive.google.com/file/d/1azMCEj2D2PSHhbPjgxlfYep4I7tS5BvW/view?usp=sharing

Unsupervised Machine Learning¶

Session on DBSCAN¶

Description

Code - https://colab.research.google.com/drive/1yZ0bg0cK6X84u6rawgRgoPzlgAmhVy2E?usp=sharing
Visualization - https://www.naftaliharris.com/blog/visualizing-dbscan-clustering/
Notes - https://drive.google.com/file/d/1plNghtufMn7uj1cQTQVRzy4Ss8xiXOE9/view?usp=sharing

Session on Hierarchical Clustering¶

Description

Code - https://colab.research.google.com/github/campusx-official/agglomerative-hierarchical-clustering-demo/blob/main/agglomerative-clustering.ipynb
Dataset - https://www.kaggle.com/datasets/rohan0301/unsupervised-learning-on-country-data/code?datasetId=721951&sortBy=voteCount
Notes - https://drive.google.com/file/d/1kTc7WPCXgVZywaHfyFwKQA9NTvZA9Wy9/view?usp=sharing

Session on Gaussian Mixture Models¶

Description

Code - https://colab.research.google.com/drive/1pT8C8wSzCC3NmfIQaXBiAJrcsxxTYnF1?usp=sharing

colab.research.google.com

Session 2 on Gaussian Mixture Models¶

Description

Notes - https://drive.google.com/file/d/1BFwJJfbSPQaf5R8WBWv7QtrVtBQd6SeN/view?usp=sharing
Code - https://colab.research.google.com/drive/1pT8C8wSzCC3NmfIQaXBiAJrcsxxTYnF1?usp=sharing

Session on T-SNE¶

Description

Code - https://colab.research.google.com/drive/1N2kGH2U73JkMbD_OPp4H0QgHE066GlwG?usp=sharing
Blog 1 - https://distill.pub/2016/misread-tsne/
Blog 2 - https://colah.github.io/posts/2014-10-Visualizing-MNIST/
Notes - https://drive.google.com/file/d/1FqmADKeMxbrbH2oPGW-m9wZ5fcfkoPWF/view?usp=sharing
Research Paper - https://www.jmlr.org/papers/volume9/vandermaaten08a/vandermaaten08a.pdf

Session 2 on T-SNE¶

Description

Notes - https://drive.google.com/file/d/16txXFIzU1MUOmnqVkCDgjUzue4VSTqWZ/view?usp=sharing
Code - https://colab.research.google.com/drive/1N2kGH2U73JkMbD_OPp4H0QgHE066GlwG?usp=sharing

KMeans Clustering¶

Session 1 on K Means Clustering¶

Description

Code - https://colab.research.google.com/drive/18CynErsHaQ_BanYv0ruq2mFF9Hncmu2V?usp=sharing
Notes - https://drive.google.com/file/d/1Am4db3eUVhw_wpPG-R_dqiewZxoB5xRn/view?usp=sharing

Session 2 on KMeans Clustering¶

Description

Code - https://colab.research.google.com/drive/18CynErsHaQ_BanYv0ruq2mFF9Hncmu2V?usp=sharing
Assignment - https://www.kaggle.com/code/campusx/ipl-kmeans-clustering
Notes - https://drive.google.com/file/d/11rBoavT2eGWzElwEkNrgh4PpoZ2MP-mF/view?usp=sharing
Research Papers
https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=b452a856a3e3d4d37b1de837996aa6813bedfdcf
https://www.cse.iitd.ac.in/~rjaiswal/2015/col870/Project/Nipun.pdf
https://theory.stanford.edu/~sergei/papers/kMeansPP-soda.pdf

Session 3 on KMeans Clustering¶

Description

Code - https://colab.research.google.com/drive/1j5fLdvQU5-phpm8L5Su6ydGrXbeUMG8a?usp=sharing
Task Dataset - https://www.kaggle.com/datasets/elemento/nyc-yellow-taxi-trip-data?select=yellow_tripdata_2015-01.csv
Research Paper - https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=b452a856a3e3d4d37b1de837996aa6813bedfdcf
Notes - https://drive.google.com/file/d/1sUtu0DOa9DEIuIA41gRarE5Qb4utXyWm/view?usp=sharing

K-Means Clustering Algorithm From Scratch In Python¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/kmeans

github.com

MiniBatch KMeans Task Solution¶

Description

Code - https://www.kaggle.com/code/sampsuman/minibatchclustering-nyc-yellowtaxi

www.kaggle.com

MLOps¶

Session 1 on MLOPs - Introduction to MLOps¶

Description

Webpage - https://drive.google.com/file/d/1Qh09qB1hAWM2ViFnuQRPG1TSLqdum4Yp/view?usp=sharing
Notebook - https://drive.google.com/file/d/1PWFkoj_o4akglbGVlxnSQYar6xOZiAmv/view?usp=sharing

Session 2 on MLOps - Version Control¶

Description

ML Pipeline Code - https://drive.google.com/file/d/1CK9TajkCSjZRGmHh32LkOvFv_mSrpR9q/view?usp=sharing
Week 1 notebook - https://drive.google.com/file/d/1PWFkoj_o4akglbGVlxnSQYar6xOZiAmv/view?usp=sharing
 

Session 3 on MLOps - Reproducibility¶

Description

Notebook - https://drive.google.com/file/d/1ShYun3a-n1RNcvO_AD659O-0rsD8WSM-/view?usp=sharing

drive.google.com

Session 4 on MLOps - Data Version Control (DVC)¶

Description

Code - https://drive.google.com/file/d/1RUHrttSmQu0a1RGvFXw3OKyYmx0YVwH-/view?usp=sharing

drive.google.com

Session 5 on MLOps - ML Pipelines and Experimentation Tracking¶

Description

Notebook - https://drive.google.com/file/d/1niffrZBNnPbYIMFkCcFVjsrZFeuP7kQC/view?usp=sharing

drive.google.com

Session 6 on MLOps¶

Description

Notebook - https://drive.google.com/file/d/1niffrZBNnPbYIMFkCcFVjsrZFeuP7kQC/view?usp=sharing

drive.google.com

Session 7 on MLOps | Continuous Integration¶

Description

Code - https://drive.google.com/file/d/1niffrZBNnPbYIMFkCcFVjsrZFeuP7kQC/view?usp=sharing

drive.google.com

Session 8 on MLOps - Dockers¶

Description

Code - https://drive.google.com/file/d/1niffrZBNnPbYIMFkCcFVjsrZFeuP7kQC/view?usp=sharing

drive.google.com

Session 9 on MLOPs - Continuous Deployment¶

Description

Code - https://drive.google.com/file/d/1qYUEKjCAy19llr5wlAnK76Bfn2FhjeW2/view?usp=sharing

drive.google.com

Session 10 on MLOps - Introduction to AWS¶

Description

Code - https://drive.google.com/file/d/1Sh1zsLvbZex0WnAYqVpaVAtTv9MRHO34/view?usp=drive_link

drive.google.com

Session 12 on MLOps - Distributed Infrastructure¶

Description

Code - https://drive.google.com/file/d/1Iwf4RXf0Ap48V3JTZqU1c4TWjuFSlerC/view?usp=sharing

drive.google.com

Session 13 on MLOps - Kubernetes Internals¶

Description

Notebook - https://drive.google.com/file/d/1Iwf4RXf0Ap48V3JTZqU1c4TWjuFSlerC/view?usp=sharing

drive.google.com

Session 14 on MLOps - Deployment on Kubernetes¶

Description

Notebook - https://drive.google.com/file/d/1Iwf4RXf0Ap48V3JTZqU1c4TWjuFSlerC/view?usp=sharing

drive.google.com

Session 15 on MLOps - Seldon Deployments¶

Description

Notebook - https://drive.google.com/file/d/1FJYzGxTc-sg3-GMc35cjSr7CxUhXSoPp/view?usp=sharing

drive.google.com

Session 16 on MLOps - Monitoring & Alerting¶

Description

Notebook - https://drive.google.com/file/d/1JnkgIkiNFUXlow_uF82-jiZ61cl329xW/view?usp=sharing

drive.google.com

Session 17 on Rollout & Rollback Strategies¶

Description

Notebook - https://drive.google.com/file/d/14kTAejx1lj_B9JaupF6kFSXO51KraYJj/view?usp=sharing

drive.google.com

Session on MLOps Interview Questions¶

Description

Notebook - https://drive.google.com/file/d/1C41fY26JDCk8QUGHGt9pKGLdD7XBwdsz/view?usp=sharing

drive.google.com

Session 18 on MLOps - ML Technical Debt¶

Description

Notebook - https://drive.google.com/file/d/1C41fY26JDCk8QUGHGt9pKGLdD7XBwdsz/view?usp=sharing

drive.google.com

XGBoost¶

Introduction to XGBoost | XGBoost Part 1¶

Description

Paper - https://youtu.be/C6aDw4y8qJ0
Notes - https://drive.google.com/file/d/1ytqdl3DZBPMcyOKLjSuy_NEYQgJMr9zE/view?usp=sharing

XGBoost for Regression | XGBoost Part 2¶

Description

Notes - https://drive.google.com/file/d/17uGCjrDNfTLF7lYloXgIKDmkrlpBZyuI/view?usp=sharing

drive.google.com

XGBoost For Classification | XGBoost Part 3¶

Description

Notes - https://drive.google.com/file/d/18It-0bZiSdjTDONkfmBUXqdbRHn3ZUss/view?usp=sharing

drive.google.com

The Complete Maths of XGBoost | XGBoost Part 3¶

Description

Notes - https://drive.google.com/file/d/11Qx05XC6vUNvG5LvalWKSlNwY2knuNcr/view?usp=sharing

drive.google.com

Capstone Project¶

Session 1 on Capstone Project | Data Gathering¶

Description

Datasets
https://docs.google.com/spreadsheets/d/1mFNBKFgwFnCXvRsLps5FbsPt_WNnFYyOvBGwSFHZHRU/edit?usp=sharing
https://docs.google.com/spreadsheets/d/19Uw-4uktVEQKFzVHTRkd0DMJ4v3lQJsiCH63PlHwQdw/edit?usp=sharing
https://docs.google.com/spreadsheets/d/1z55UOBr3nfFYf5JXkCAcTGameKrOm2a0irRfjKrpdSs/edit?usp=sharing
https://docs.google.com/spreadsheets/d/1FzCcUbzBKG78snWFg3tAAD4E1sIjCClD9cfsA2DWAjg/edit?usp=sharing
Web Scraping Codes
Flats/Appartments : - https://colab.research.google.com/drive/1bKT92iRVecazQcc3eJmpZH7HZyCLk-oO?usp=sharing
https://colab.research.google.com/drive/1IclV7RVZSVNe3fo5WapspW6uo9uWLTU5?usp=sharing
https://colab.research.google.com/drive/1cmJ9xbSvErNXnfVP0xVBpcvf2hRz3fmp?usp=sharing
 
Notebook PDF : https://drive.google.com/file/d/179HLl-HQVoAFUKcGUtUujW72T1QJpcCJ/view?usp=sharing

Session 2 on Capstone Project | Data Cleaning¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project
Onenote PDF : https://drive.google.com/file/d/1AsiyoyiMttez-Y35bJkD7sXiPVtcm5ub/view?usp=sharing

Session 3 on Capstone Project | Feature Engineering¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project

Onenote PDF : https://drive.google.com/file/d/1_HsADKK-wsHBOPPIhNoxxTB6n_TioKRM/view?usp=sharing

Session 4 on Capstone Project | EDA¶

Description

Code : https://github.com/campusx-official/dsmp-capstone-project
NOtebook PDF : https://drive.google.com/file/d/1M_nBre0L50HtiqQxW791syyJVvSOaJWG/view?usp=sharing

Session 5 on Capstone Project | Outlier Detection and Removal¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 6 on Capstone Project | Missing Value Imputation¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 7 on Capstone Project | Feature Selection¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 8 on Capstone Project | Model Selection & Productionalization¶

Description

https://github.com/campusx-official/dsmp-capstone-project
Website Code - https://github.com/campusx-official/real-estate-app
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 9 on Capstone Project | Building the Analytics Module¶

Description

https://github.com/campusx-official/real-estate-app
https://github.com/campusx-official/dsmp-capstone-project
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 10 on Capstone Project | Building the Recommender System¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project
Apartment Data : https://colab.research.google.com/drive/1Ms-86hbsFojEG_0lXdeI5wiFzm5T0xUM?usp=sharing
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 11 on Capstone Project | Building the Recommender System Part 2¶

Description

Code - https://github.com/campusx-official/real-estate-app/blob/master/pages/3_Recommend%20Appartments.py
Notebook PDF Session 4 Onwards: https://drive.google.com/file/d/1PS-M1pWgfU_wMKDJNQq1iNNSBb_iRsuG/view?usp=sharing

Session 12 on Capstone Project | Building the Insights Module¶

Description

Code - https://github.com/campusx-official/dsmp-capstone-project/blob/master/insights-module.ipynb

github.com

Session 13 on Capstone Project | Deploying the application on AWS¶

Description

Blog link - https://learnwith.campusx.in/blog/deploying-a-streamlit-app-on-aws-ec2

learnwith.campusx.in

Week 36 - Gradient Boosting¶

Session 1 on Gradient Boosting for Regression¶

Description

https://colab.research.google.com/drive/1GSeBSO22uMbCvlN10x0mxtBbx9Ap2nqM?usp=sharing
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/gradient-boosting/gradient_boost_step_by_step.ipynb
Notebook PDF : https://drive.google.com/file/d/1-MO6IasyTlk2jWX61SPuQ_4I1MsLJ0-a/view?usp=sharing

Session 2 on Gradient Boosting | Perspectives¶

Description

Notebook PDF : https://drive.google.com/file/d/1wIBrSzZhbf1DSkqlzXHtBXBt-Oc1YmhC/view?usp=sharing
Blog : https://explained.ai/gradient-boosting/L2-loss.html
Paper: https://jerryfriedman.su.domains/ftp/trebst.pdf

Gradient Boosting for Classification Part 1¶

Description

Code - https://colab.research.google.com/drive/15G44KBuSgHs7hdSIuhwh-LR7Qur4xRyD?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1i6AsjCJtfilbhNxkG39kmQsJNRtuMwKq/view?usp=sharing

Gradient Boosting for Classification | Geometric Intuition¶

Description

Blog Link - https://towardsdatascience.com/all-you-need-to-know-about-gradient-boosting-algorithm-part-2-classification-d3ed8f56541e
Codes - https://colab.research.google.com/drive/13p46IFhg3h6BIdjxUcfXPco13jIOCV6I?usp=sharing

Gradient Boosting Classification | Maths Formulation¶

Description

Notebook PDF : https://drive.google.com/file/d/1i6AsjCJtfilbhNxkG39kmQsJNRtuMwKq/view?usp=sharing

drive.google.com

Week 35 - Random Forest¶

Bagging | Introduction | Part 1¶

Description

https://github.com/campusx-official/bagging-ensemble

github.com

Bagging Ensemble | Part 2 | Bagging Classifiers¶

Description

https://github.com/campusx-official/bagging-ensemble

github.com

Bagging Ensemble | Part 3 | Bagging Regressor¶

Description

https://github.com/campusx-official/bagging-ensemble

github.com

Session 1 on Random Forest¶

Description

https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day65-random-forest/rf_learning_tool.ipynb
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day65-random-forest/random_forest_demo.ipynb
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day65-random-forest/feature-importance-in-sklearn.ipynb
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day65-random-forest/bagging_vs_random_forest.ipynb
Notebook PDF : https://drive.google.com/file/d/1EK3u-hyEiK0wMMsy07fI0Dn-jwOicbEU/view?usp=sharing
 

Session 2 on Random Forest¶

Description

https://colab.research.google.com/drive/1_9MoZF1Vxa5AKGo0VNBK8qE1qL45pmua?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1sV_PG8PNCzXNvN73cvSxw3NDiwh1Y7bc/view?usp=sharing
 

Week 34 - Decision Trees¶

Session 1 on Decision Trees¶

Description

Code - https://colab.research.google.com/drive/10nSMmbHuaehgd7TdgcTVsS8isPq8bJqY?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1EaPkxw5I0KstZARjQMq_J8yOQYt82Hb0/view?usp=sharing

Session 2 on Decision Trees¶

Description

Code - https://colab.research.google.com/drive/10nSMmbHuaehgd7TdgcTVsS8isPq8bJqY?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1yzsKHIMMeOztAK1Jcg5dy9OvVAJjMMun/view?usp=sharing

Session 3 on Decision Trees | Pruning¶

Description

Code - https://colab.research.google.com/drive/10nSMmbHuaehgd7TdgcTVsS8isPq8bJqY?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1yU-tkH1KMGu7pXHsHmPz5I94NNw7W2wj/view?usp=sharing

Awesome Decision Tree Visualization using dtreeviz library¶

Description

Code - https://github.com/campusx-official/dtreeviz-demo

github.com

Week 33 - Support Vector Machines (SVM)¶

SVM Part 1 - Hard Margin SVM¶

Description

Code - https://colab.research.google.com/github/campusx-official/Support-Vector-Machines-SVM-/blob/master/SVM%20Demo.ipynb

Notebook PDF : https://drive.google.com/file/d/10-cpW7zDF91m1I4Otb2o9h4MVLnQ7CaP/view?usp=sharing

SVM Part 2 | Soft Margin SVM¶

Description

Code - https://colab.research.google.com/github/campusx-official/Support-Vector-Machines-SVM-/blob/master/SVM%20Demo.ipynb
Notebook PDF : https://drive.google.com/file/d/1M2fdMaMp8fKHZRf4HTkpx7OpdrIOQXeO/view?usp=sharing

Session on Constrained Optimization Problem¶

Description

Code - https://colab.research.google.com/drive/1QYO4i5H3amkUyFEYxyVcv1u79Y4x4bVh?usp=sharing
Code - https://colab.research.google.com/github/campusx-official/Support-Vector-Machines-SVM-/blob/master/Kernel%20Trick%20SVM.ipynb
Notebook PDF : https://drive.google.com/file/d/1t0iKw5AoVcxFmlKldNTjuteFiWEMQe5n/view?usp=sharing

Session on SVM Dual Problem¶

Description

Notebook PDF : https://drive.google.com/file/d/1eN3a76-ZS8IEBMc4kCwRaDDKeofQd9Ip/view?usp=sharing

drive.google.com

Session on Maths Behind SVM Kernels¶

Description

Notebook PDF : https://drive.google.com/file/d/1yn8tnyqx284XWQqtSFhKJmaKhci9pdR3/view?usp=sharing
Code: https://colab.research.google.com/drive/1NCmy1ivmanc_fN0IwqQDWO1Z_X_QToBs?usp=sharing

Week 32 - Logistic Regression¶

Session 1 on Logistic Regression¶

Description

Notebook PDF : https://drive.google.com/file/d/1L2AY78TgqOlf7rlsRDZhQABAThqsRQRz/view?usp=sharing

drive.google.com

Session on Multiclass Classification using Logistic Regression¶

Description

Code - https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day60-logistic-regression-contd/softmax-demo.ipynb
 
Notebook PDF : https://drive.google.com/file/d/1neEFkUXbXx5RktYKoPe2qW8CV460G_rX/view?usp=sharing

Session on Maximum Likelihood Estimation¶

Description

Notebook PDF : https://drive.google.com/file/d/1rWU7rM-aQBN4YyFyq-iJZ8d8dVar7HlU/view?usp=drive_link

drive.google.com

Session 3 on Logistic Regression¶

Description

Code - https://colab.research.google.com/drive/14yneTfvrXQLC_drOPCme1lWKMnxLGN5d?usp=sharing
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day60-logistic-regression-contd/polynomial-logistic-regression.ipynb

Notebook PDF :  https://drive.google.com/file/d/1A4KTY5onEMazPXLnRxQiJOMBnyfZoGj6/view?usp=sharing

Logistic Regression Hyperparameters¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day60-logistic-regression-contd

This link contains all notebook PDFs used in 100 Days of ML Playlist.

https://lnkd.in/gxv947TB

Week 31 - Naive Bayes¶

Crash Course on Probability Part 1¶

Description

Notebook PDF : https://drive.google.com/file/d/1tH8hrTpU3SG-b6v3ecWxwSu1FRENeP6S/view?usp=sharing

drive.google.com

Crash Course on Probability Part 2¶

Description

Code - https://colab.research.google.com/drive/1q0yJ-6pTLkXyETwS41uFf5ggOagxydvt?usp=sharing
Notebook PDF Probability part - 2: https://drive.google.com/file/d/1ygZT5izsUcp8AsmmqDjj8CcP_5WpKnFP/view?usp=sharing

Session 1 on Naive Bayes¶

Description

Code - https://colab.research.google.com/drive/1lbqkDb-3TQn4xKu3yUzMeS8tjgZLwd4k?usp=sharing
https://www.kaggle.com/campusx/sentiment-analysis-using-naive-bayes
PDF: https://drive.google.com/file/d/1UqadGJVXFZEPD4YOUAZ2t15mJCpJYghS/view?usp=sharing

Session 2 on Naive Bayes¶

Description

Notebook PDF : https://drive.google.com/file/d/1ocyl8UZi3qS-ARaw7_Ot9qUGLhA20f9n/view?usp=sharing

drive.google.com

Session 3 on Naive Bayes¶

Description

Notebook PDF :  https://drive.google.com/file/d/13DrpcVdFMZJv9G6GghF6Wpghr_WykVMW/view?usp=sharing

drive.google.com

Email Spam Classifier | End to End Project¶

Description

Code - https://github.com/campusx-official/sms-spam-classifier
Dataset - https://www.kaggle.com/datasets/uciml/sms-spam-collection-dataset

Week 30 - Model Evaluation and Selection¶

ROC Curve in Machine Learning¶

Description

Code - https://colab.research.google.com/drive/1nPPKRtmtoVf-GNEpJVAHmEQhXNwXOlXV?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1Ql1vwGgDfPy7b5nXQPqtRltpyx1NUGHT/view?usp=sharing

Session on Cross Validation¶

Description

Code - https://colab.research.google.com/drive/1p6b3Wlt7r8gRK4OhHUrGMDQqpWRM66x9?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1Xye1BmIcFBl9aNp_B4Dkmx2bHRtwyqWx/view?usp=sharing

Session on Data Leakage¶

Description

Code - https://colab.research.google.com/drive/1p6b3Wlt7r8gRK4OhHUrGMDQqpWRM66x9?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1EPB0JfGo6orB4ndEy5A5UroLKtNBL4XU/view?usp=sharing

Session on Hyperparameter Tuning¶

Description

Code - https://colab.research.google.com/drive/1AWo5sjSZw3EgehYnuNLTPoPO08g8yU_O?usp=sharing
Notebook Pdf : https://drive.google.com/file/d/1_LpP5xdrnZn842FyxdAkI5oGCS9t5TQU/view?usp=sharing

Week 29 - PCA¶

PCA Part 3 | Code Example and Visualization¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day47-pca

github.com

Task - 57 (PCA)¶

Assignment Link

Session on Eigen Vectors and Eigen Values¶

Description

Notebook PDF : https://drive.google.com/file/d/1uuHklMvFbMlTRBhosis4OFfaK-FHGAyr/view?usp=sharing

drive.google.com

Session on Eigen Decomposition + PCA Variants¶

Description

Code - https://colab.research.google.com/drive/1AsPuskXViToTGHu6377GgWRbtyjdDLli?usp=sharing
PDF: https://drive.google.com/file/d/1X414H0nFrWy96Qh2qXadL_b01j5aVn8O/view?usp=sharing

Session on Singular Value Decomposition¶

Description

Code - https://colab.research.google.com/drive/1JiPh_ImSfbclRpwCTnMYz5yh8fGR6wC6?usp=sharing
PDF: https://drive.google.com/file/d/1qNIEcxK8Cq9CFPfJ47eVbb3ht6pPN67e/view?usp=sharing

Week 28 - K Nearest Neighbors¶

Session 1 on K-Nearest Neighbors¶

Description

 
Code - https://www.kaggle.com/code/campusx/knn-on-breast-cancer-dataset
PDF : https://drive.google.com/file/d/1nGa-6YW7SISVroOd8GAY_kzSARERSZjL/view?usp=share_link

Coding K Nearest Neighbors from Scratch¶

Description

Code - https://github.com/campusx-official/knn-from-scratch

github.com

How to draw Decision Boundary for classification algorithms¶

Description

Code - https://github.com/campusx-official/K-Nearest-Neighbors

github.com

Session on Advanced KNN¶

Description

PDF : https://drive.google.com/file/d/10i4EYgzngNIVVkeOBwdPWZpo8Nqj2w_S/view?usp=share_link
Visualizer Code : https://github.com/samp-suman/KNN-Visualizer

Task - 56 (KNN)¶

Assignment Link

Classification Metrics Part 1 | Accuracy and Confusion Matrix | Type 1 and Type 2 Errors¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day59-classification-metrics

Accuracy is not reliable measure in case of imbalanced data : https://colab.research.google.com/drive/12hFmzcyfQetk5MUlTMNrrl1VCoYDGNfT?usp=sharing

Classification Metrics Part 2 | Precision, Recall and F1 Score¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day59-classification-metrics

github.com

Week 27 - Regularization¶

Regularization Part 1 | Bias Variance Trade-off¶

Description

PDF - https://drive.google.com/file/d/17YPjz8n3M1upSAWVQSmKvqZ8OXiHUQeX/view?usp=sharing

drive.google.com

Regularization Part 2 | What is Regularization | Paid Zoom Session | 19th May¶

Description

Code - https://colab.research.google.com/drive/1fco0kYZaU8wPbfSaX4DHPNI_vZwB48AN?usp=sharing
PDF - https://drive.google.com/file/d/17YPjz8n3M1upSAWVQSmKvqZ8OXiHUQeX/view?usp=share_link

Ridge Regression Part 1 | Geometric Intuition and Code | Regularized Linear Models¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day55-regularized-linear-models
100 Days of ML Notes : https://drive.google.com/file/d/1MENZBaet8y7PHjLbn8NpwitUZpIOFpCd/view

Ridge Regression Part 2 | Mathematical Formulation & Code from scratch | Regularized Linear Models¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day55-regularized-linear-models
100 Days of ML Notes : https://drive.google.com/file/d/1MENZBaet8y7PHjLbn8NpwitUZpIOFpCd/view

Ridge Regression Part 3 | Gradient Descent | Regularized Linear Models¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day55-regularized-linear-models
 
100 Days of ML Notes : https://drive.google.com/file/d/1MENZBaet8y7PHjLbn8NpwitUZpIOFpCd/view

Ridge Regression Part 4 | 5 Key Points | Regularized Linear Models¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day55-regularized-linear-models
 
100 Days of ML Notes : https://drive.google.com/file/d/1MENZBaet8y7PHjLbn8NpwitUZpIOFpCd/view

Lasso Regression | Intuition and Code Sample | Regularized Linear Models¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day56-lasso-regression
100 Days of ML Notes : https://drive.google.com/file/d/1MENZBaet8y7PHjLbn8NpwitUZpIOFpCd/view

Why Lasso Regression creates sparsity?¶

Description

Code - 

Task-Regularisation¶

Assignment Link

ElasticNet Regression | Intuition and Code Example | Regularized Linear Models¶

Description

https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day57-elasticnet-regression

github.com

Week 26 - Feature Selection¶

Session 54 - Feature Selection Part 1 | Filter Methods¶

Description

Code - https://www.kaggle.com/campusx/filter-based-feature-selection
Notes - https://drive.google.com/file/d/1CZPRLuSBmyPw7ftQVIyS7sreoemJGGn-/view?usp=sharing

Task 54¶

Assignment Link

Session 55 - Feature Selection Part 2 | Wrapper Methods¶

Description

Code - https://colab.research.google.com/drive/1asS5x04z2La1uAqpkHkz4NVvySWKZ7V6?usp=sharing
Notes - https://drive.google.com/file/d/1l0D_YMFkVM8z3I3INn6ou2aCxrHSoeGK/view?usp=sharing

Task 55¶

Assignment Link

Session 3 on Feature Selection | Embedded Methods¶

Description

Code - https://colab.research.google.com/drive/11RD671YkUNJsDaQm06kSNXpCdeDZE5US?usp=sharing
PDF - https://drive.google.com/file/d/1WnIiMTWZA7ZBAhzYsjNTACinrTgH7pUl/view?usp=share_link

Week 25 - Regression Analysis¶

Session 1 on Regression Analysis¶

Description

Code - https://colab.research.google.com/drive/1kHcIPKnRaJeBjVkEv3e6DIpzVQdqW2Iy?usp=sharing

colab.research.google.com

Session 2 on Regression Analysis¶

Description

Code - https://colab.research.google.com/drive/1kHcIPKnRaJeBjVkEv3e6DIpzVQdqW2Iy?usp=sharing
PDF - https://drive.google.com/file/d/1iBcf8Ma5Ueqc0ZIvQeFFltKSVWqBKYoX/view?usp=sharing
 

Polynomial Regression¶

Description

Code - https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day53-polynomial-regression

github.com

Session on Assumptions of Linear Regression¶

Description

Code - https://colab.research.google.com/drive/10NHeEW_xeuOHloDWgrb6U2osJAbUOxSD?usp=sharing
PDF - https://drive.google.com/file/d/1-hJ51wKAYguczZ44WxdpblNrZNy7VaWY/view?usp=share_link

Session 53 - Session on Multicollinearity¶

Description

Code - https://colab.research.google.com/drive/1bCaiuxiKShosnKhaqDAE1lWXAi-_Fbb2?usp=sharing
PDF - https://drive.google.com/file/d/131sJUu335FSldQmdP4qTbp4qdSG4jYN8/view?usp=sharing

Task 53¶

Assignment Link

Week 24 - Gradient Descent¶

Session 51 - Gradient Descent From Scratch¶

Description

 
Code used: https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day51-gradient-descent
Google Tool used: https://developers.google.com/machine-learning/crash-course/fitter/graph
PDF of 100 Days of ML : https://drive.google.com/file/d/1MENZBaet8y7PHjLbn8NpwitUZpIOFpCd/view

Session 52 (Part 1) - Batch Gradient Descent¶

Description

Code : https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day52-types-of-gradient-descent

Session 52 (Part 2) - Stochastic Gradient Descent¶

Description

Code used : https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day52-types-of-gradient-descent

github.com

Session 52 (Part 3) - Mini-Batch Gradient Descent¶

Description

Code used : https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day52-types-of-gradient-descent

github.com

Week 23 - Linear Regression¶

Session 48 - Introduction to Machine Learning¶

Description

PDF: https://drive.google.com/file/d/1N_9GqQjWSWzX4BDXUJK-q4FwuEVuEKtn/view

drive.google.com

Session 49 - Simple Linear Regression¶

Description

https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day48-simple-linear-regression
https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day49-regression-metrics
PDF: https://drive.google.com/file/d/18oSjN8aEztz_m-_CoKb5i_kGHvKccjdp/view?usp=share_link

Session 50 - Multiple Linear Regression¶

Description

https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day50-multiple-linear-regression/multiple_linear_regression.ipynb#scrollTo=NpAvnU-t3yV0
 
https://colab.research.google.com/github/campusx-official/100-days-of-machine-learning/blob/main/day50-multiple-linear-regression/code-from-scratch.ipynb#scrollTo=afc9a715
PDF: https://drive.google.com/file/d/1fYGa7wXCirq8Tvo2YqfHsQSlhs1DXXwo/view?usp=share_link

Session on Optimization The Big Picture¶

Description

PDF: https://drive.google.com/file/d/11jO22cqMyGcmvunqJ_nkRdrSgcJkmJR1/view?usp=share_link

drive.google.com

Session on Differential Calculus¶

Description

PDF: https://drive.google.com/file/d/18whooe9qIj32ypW3LKKgatJDisO8AtKn/view?usp=share_link

drive.google.com

Week 22 - Linear Algebra¶

Linear Algebra - Part 1 | Vectors¶

Description

Code - https://colab.research.google.com/drive/1WfshZ-tT4XOFAuYK6dvQaEvsqpesELq0?usp=sharing
Notebook PDF : https://drive.google.com/file/d/1E3Q2_omNtvE6sydWTahYsysEcyVQn8EG/view?usp=share_link

Task 47¶

Assignment Link

Linear Algebra Part 2 | Matrices (Computation)¶

Description

Notebook PDF : https://drive.google.com/file/d/1yZvM8rpKVLHCKjYCanu4tFktabumIeZv/view?usp=share_link

drive.google.com

Linear Algebra Part 3 | Matrices (Intuition)¶

Description

Tools
https://campusx-official-matrix-linear-transformation-viz-linear-x7jwva.streamlit.app/
https://campusx-official-matrix-linear-transformation-v-multiply-5a32p1.streamlit.app/
https://campusx-official-matrix-linear-transformatio-determinant-96ncsg.streamlit.app/
 
PDF: https://drive.google.com/file/d/1XCIpJ-vPuuMniSipujE4GNXXOuVK1W6y/view?usp=share_link

Week 21 - Hypothesis Testing¶

Session 45 - Hypothesis Testing Part 1¶

Description

Session 45 PDF Notebook: https://drive.google.com/file/d/1J6TWERqWu1-98n2b8uBKdU8j0aCVgyuN/view?usp=share_link

drive.google.com

Session 46 - Hypothesis Testing Part 2 | p-values | t-tests¶

Description

Code - https://colab.research.google.com/drive/1W2ts8cTUwnAQL47QHZI3iWoE6KJz4bHf?usp=sharing
https://www.kaggle.com/campusx/titanic-single-sample-t-test
https://www.kaggle.com/campusx/titanic-2-sample-t-test
Notebook PDF Link: https://drive.google.com/file/d/17rN645-blGEO59vA6Jkvieh-IbFeOP8t/view?usp=share_link

Task 46¶

Assignment Link

Session on Chi Square Tests¶

Description

Code - https://colab.research.google.com/drive/113wsZhFcUDa-QnOeY3KmIBsJToyzE80e?usp=sharing
PDF - https://drive.google.com/file/d/1nmxMlse95CE0it612u55s9hudL2Xef4K/view?usp=share_link
 
@TimeStamp 02:01:00 :



Formlua for Chi-Square is : (Observed - Expected)^2 / Expected, not   (Observed - Expected)^2 / Observed
Calculation would go like : 
(15-12)^2 / 12 + (20-19)^2 / 19 + ... + (40 - 12)^2 / 12

Session on ANOVA¶

Description

Code - https://colab.research.google.com/drive/1RKIB0EqIc1kWJNlwYqMfUEohqVqq-yaa?usp=sharing
PDF - https://drive.google.com/file/d/1gQAeNqaL2sBFSpAtONhtn9oBzWFh7c6h/view?usp=share_link

Week 20 - Inferential Statistics¶

Session 43 - Central Limit Theorem¶

Description

Code - https://colab.research.google.com/drive/1W--4mte3uaDD8rReLAt4OWEoLXrb5Ij7?usp=sharing
Dataset Link: https://www.kaggle.com/campusx/titanic-clt
Session 43 Notebook PDF : https://drive.google.com/file/d/14izghg0Aw5v3ed1rq9xEHZVcG_x0hMD-/view?usp=share_link

Task 43¶

Assignment Link

Session 44 - Confidence Intervals¶

Description

Session 44 Notebook PDF : https://drive.google.com/file/d/1nskWHtR1ePmrje76k71gdUc2-fcVWvMH/view?usp=share_link

drive.google.com

Task 44¶

Assignment Link

Week 19 - Probability Distributions¶

Session 41 - Normal Distribution¶

Description

Code - https://colab.research.google.com/drive/1N_T0_w5vpT1k1Z4pSf4IMhAxYT1nRKLU?usp=sharing
Viz tool - https://samp-suman-normal-dist-visualize-app-lkntug.streamlit.app/
Z-table - https://www.ztable.net/

Notebook Pdf : https://drive.google.com/file/d/11V7c5D80UDteolb_DVgS0em72pfMB-i0/view?usp=share_link

Task 41¶

Assignment Link

Session 42 - Non-Gaussian Probability Distributions¶

Description

Code - https://colab.research.google.com/drive/1Q2ug8BXogFqYY_6e04dmHk0Tn5_yfObo?usp=sharing https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day30-function-transformer https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day31-power-transformer
Session 42 Notebook PDF: https://drive.google.com/file/d/1sd4nz8PNsGc334ng86V8uKvJNqhWyRZ2/view?usp=share_link

Session on Views and User Defined Functions in SQL¶

Description

Session Notebook PDF (View and User Defined Function): https://drive.google.com/file/d/1NxvHiK-NJBIAzKMfFwf2ibgDRzYadqpS/view?usp=share_link

drive.google.com

Session on Transactions & Stored Procedures¶

Description

Session Notebook PDF : https://drive.google.com/file/d/1EbU6gv0xLmllHRvnpHkp4HEmqHGDt7qx/view?usp=share_link

drive.google.com

Week 18 - Descriptive Statistics Contd.¶

Session 39 - Descriptive Statistics Part 2¶

Description

Code - https://colab.research.google.com/drive/19YlpW_N7idyQQvmpgrZg8KNSvIjCPk-8?usp=sharing
Session-39 Notebook: https://drive.google.com/file/d/1edN9LSbMP3lPh4YMem4n9K0Y6lSeFaP1/view?usp=share_link

Session 40 - Probability Distribution Functions - PDF, PMF & CDF¶

Description

Code - session-40.ipynb - Colaboratory (google.com)
Session 40 Notes : https://drive.google.com/file/d/1FQ65CTmMLK-PYZ6NT9txGcGmJHobtNYl/view?usp=share_link

Task 40¶

Assignment Link

SQL Datetime Case Study on Flights dataset¶

Description

Code - https://docs.google.com/document/d/1g67XZ96yhIz6mqfzXJRhVvZXP26VVbYE4YX51Ck4c84/edit?usp=sharing
Dateset - https://docs.google.com/spreadsheets/d/13_PAiduepzVBMU-WYp_10NBMfMH12A9D2KgzieOtk1o/edit?usp=sharing
EDA question Pdf : https://drive.google.com/file/d/1DPA__10bpvte9wtvgLqehpZ9w31HZm2g/view?usp=share_link

For Q6: During Updation you would be getting error, like same as sir got warnings : Truncate invalid double value '5m', This is coming because of row no 5975.
Updated Query :

UPDATE flights

SET duration_mins = 

CASE

        WHEN duration LIKE '%h %m' THEN

            SUBSTRING_INDEX(duration, 'h', 1) * 60 +

            SUBSTRING_INDEX(SUBSTRING_INDEX(duration, ' ', -1), 'm', 1)

        WHEN duration LIKE '%h' THEN

            SUBSTRING_INDEX(duration, 'h', 1) * 60

        WHEN duration LIKE '%m' THEN

            SUBSTRING_INDEX(duration, 'm', 1)

END ;

Session on Database Design | SQL Data Types | Database Normalization¶

Description

Session On Database Design Notebook Pdf :  https://drive.google.com/file/d/1sQj7UJrSX_Y74qWgmez84Pv3oH6z6TCg/view?usp=share_link

drive.google.com

Week 17 - Descriptive Statistics¶

Session 38 - Descriptive Statistics Part 1¶

Description

Session-38 Notebook: https://drive.google.com/file/d/1da0Wj1KyUxLtGVnFcPTpgWvKQsWjZly-/view?usp=share_link

drive.google.com

Session on Datetime in SQL¶

Description

Code - https://docs.google.com/document/d/1Izh0o3ZTsVcSw5ZHsX5uB7v7IGxJ7hbX7a3VfIuFv1c/edit?usp=sharing
PDF - https://drive.google.com/file/d/11s40kbk56ZaA56f7c34SEGmZmy-_GNOa/view?usp=share_link
Documentation - https://dev.mysql.com/doc/refman/8.0/en/date-and-time-functions.html#function_date-format

Task 36¶

Assignment Link

Week 16 - Advanced SQL¶

Task 36 Solutions¶

Description

Code - https://docs.google.com/document/d/1rJXV6c-qoTwYL91zA-ajrveHjpx6ldvA2ZQFQcqEICY/edit?usp=sharing

docs.google.com

Career Pe Charcha - Markdown Basics + How to improve Github Profile¶

Description

Code - https://colab.research.google.com/drive/1-v-RUuQlaVUKiVvvEjpsiK2sw-BovjlB?usp=sharing
Links - https://github-readme-streak-stats.herokuapp.com/?user=campusx-official
https://github-readme-stats.vercel.app/api/top-langs/?username=campusx-official
https://github-readme-stats.vercel.app/api?username=campusx-official

Session 37 - Window Functions Part 2¶

Description

Code - https://docs.google.com/document/d/1PyAU4tBcBxUR5Vn4GEZngPJXkKArAHqUBZgCCu1fFk4/edit?usp=sharing
Datasets: https://drive.google.com/drive/folders/1N3WzcWpwiYwxobFIlNn9sOHVRZ-tpBdc?usp=share_link
Window Functions Part-1 Pdf : https://drive.google.com/file/d/12P7vW2VBq0_4Nm3j1aQDB599HOy0OGtk/view?usp=share_link
Window Functions Part-2&3 Pdf : https://drive.google.com/file/d/1pTPslw_dOMwkK06Cu-lcHX5NzRtmNPW5/view?usp=share_link

Session 37 - Window Functions Part 3¶

Description

Window Functions Part-1 Pdf : https://drive.google.com/file/d/12P7vW2VBq0_4Nm3j1aQDB599HOy0OGtk/view?usp=share_link

Window Functions Part-2&3 Pdf : https://drive.google.com/file/d/1pTPslw_dOMwkK06Cu-lcHX5NzRtmNPW5/view?usp=share_link
 
Timestamp 19:30 : percentile_disc and percentile_cont

These functtions are not there in MySQL(InoDB) (Workbench default server).

In the sessions I have connected Xampp MySQL server with workbench.

Task 37¶

Assignment Link

Session on Data Cleaning using SQL | Laptops Dataset¶

Description

Code - https://docs.google.com/document/d/1_urkFSBPwEzHnZuycGlcjz_S5ofGLXynxKC0cPHP-uM/edit?usp=sharing
PDF - https://drive.google.com/file/d/1bsIjjciJMHLjagopBX-YJN8eoJaVITsr/view?usp=share_link
Laptop dataset Uncleaned: https://www.kaggle.com/datasets/ehtishamsadiq/uncleaned-laptop-price-dataset
"Error Code: 1093 Resolution": https://docs.google.com/document/d/1-z5GmHsSpRWBa2_hvswMxDUO4f-ozsPTG-4mtyExhk8/edit?usp=sharing
 
1:22:00 : There are duplicate data in the datasets.

Session on EDA using SQL | Laptops Dataset¶

Description

Code Data Cleaning - https://docs.google.com/document/d/1_urkFSBPwEzHnZuycGlcjz_S5ofGLXynxKC0cPHP-uM/edit?usp=sharing
Code EDA - https://docs.google.com/document/d/1Izh0o3ZTsVcSw5ZHsX5uB7v7IGxJ7hbX7a3VfIuFv1c/edit?usp=sharing
Laptop dataset Uncleaned: https://www.kaggle.com/datasets/ehtishamsadiq/uncleaned-laptop-price-dataset

"Error Code: 1093 Resolution": https://docs.google.com/document/d/1-z5GmHsSpRWBa2_hvswMxDUO4f-ozsPTG-4mtyExhk8/edit?usp=sharing
 
EDA Plan

1. head -> tail -> sample

2. for numerical cols

    - 8 number summary[count,min,max,mean,std,q1,q2,q3]

    - missing values

    - outliers

    -> horizontal/vertical histograms

3. for categorical cols

    - value counts -> pie chart

    - missing value

4. numerical - numerical

    - side by side 8 number analysis--

    - scatterplot

    - correlation

5. categorical-categorical

    - contigency table -> stacked bar chart

6. numerical-categorical

    -> compare distribution across categories

8. missing value treatment

9. feature engineering

- ppi

- price_bracket

10. one hot encoding

Week 15 - SQL Continued¶

Session 34 - SQL Joins¶

Description

Dataset - https://drive.google.com/drive/folders/1RAo7rKbbnzt5Zrek1sq1pb3OJ7-IbQ5f?usp=share_link
Animation - https://infytq.onwingspan.com/web/en/app/toc/lex_auth_01275806667282022456_shared/overview

Task 34¶

Assignment Link

Task 34 Solutions¶

Description

Code - https://docs.google.com/document/d/1Qic8ZU2Ek5a2PBQ52-P4VQzoh4aCl3Zpmc1484ZTzsE/edit?usp=sharing

docs.google.com

SQL Case Study 1 | Zomato Dataset¶

Description

Queries - https://docs.google.com/document/d/1H0DjShcScLvrKde5ePTKQ11AM1y1mr_IS4B1Yxl9UUQ/edit?usp=sharing
Data - https://docs.google.com/spreadsheets/d/1JgNHxTixDA50W1l6pNFmHKRaX1a9QnXrpGLsJtzo6Gg/edit?usp=sharing

Session 35 - Subqueries in SQL¶

Description

Dataset Link - https://drive.google.com/drive/folders/1xCNbO_LJIkr7bi9YDa7hUFYgJ-IZ01A-?usp=share_link

For reading movies.csv in Python : 
df = pd.read_csv('movies.csv', delimiter=';', encoding_errors='ignore')

drive.google.com

Task 35¶

Assignment Link

Task 35 Solutions¶

Description

Code - https://docs.google.com/document/d/1DI1obZn5y-wa5KIVAJIePtFssbtMfwjcEkkB1xKNH78/edit?usp=sharing

docs.google.com

Making a Flights Dashboard using Python and SQL¶

Description

Code - https://github.com/campusx-official/flights-sql-app
Dataset - https://docs.google.com/spreadsheets/d/1xuKHmRuCiCXIa1m2f7cX8PqLTTQXVtV1dky0zgnrSlU/edit?usp=sharing
AWS upload - https://colab.research.google.com/drive/1qBH_ZfTanr4N9QPQHfH0k9lbHpRa6EMx?usp=sharing

SQL Interview Questions Part 1¶

Description

PDF - https://drive.google.com/file/d/1hz6Sijm-RWQ8g_mSAlllxrcjMbE7UG8V/view?usp=share_link
SQL Joins - https://learnsql.com/blog/sql-joins/
 

Week 14 - SQL Continued¶

Session 32 - SQL DML Commands¶

Description

SQL Operators: https://drive.google.com/file/d/13Cu0VUqbENcUX76bx_aoBS0EAtzDPDrE/view
Dataset : https://drive.google.com/drive/folders/1rN2AkOfuJEhroHqRuECshxBKhz0gJBeJ

Task 32¶

Assignment Link

Task 32 Solutions¶

Description

Solution - https://docs.google.com/document/d/1vYRCL6rKlUi07hlNPZH2I48om65XZKuLl3h-6APfbFM/edit?usp=sharing

docs.google.com

Session 33 - SQL Grouping + Sorting¶

Description

Dataset - https://drive.google.com/file/d/1FQ_rZm_fvak-rVd85jlWLe1XFT_BpcMZ/view?usp=share_link

drive.google.com

Task 33¶

Assignment Link

Task 33 Solutions¶

Description

Solutions - https://docs.google.com/document/d/1gRhJp2OY3UqHQ3Cs5812zddHM3zxa5gerERS2ZoboZU/edit?usp=sharing

docs.google.com

Career QnA¶

Description

Code - https://www.kaggle.com/datasets/benroshan/ecommerce-data

www.kaggle.com

Session 2 on Tableau - Sales Dataset¶

Description

Dataset - https://www.kaggle.com/datasets/benroshan/ecommerce-data

www.kaggle.com

Week 13 - SQL Basics¶

Session 30 - Database Fundamentals¶

Description

https://dataschool.com/data-modeling-101/row-vs-column-oriented-databases/
https://www.crio.do/blog/what-is-redis/
Handwritten Note used in this session: drive.google.com/file/d/1mSKBgj5OoQrEbGoV1cSabaHtNNLZIf_n/view?usp=share_link

Session 31 - SQL DDL Commands¶

Description

Notebook pdf : https://drive.google.com/file/d/146GPq4K7135qOVwggYZ4zjbWhX8J0p8W/view

drive.google.com

Session 1 on Tableau - Olympics Dataset¶

Description

Data - https://drive.google.com/drive/folders/1aq6Xmi3KQ6uQfejOSVGl1IMRwd9OQDHc?usp=share_link

drive.google.com

Week 12 - Data Analysis Process Contd.¶

Session on Data Cleaning Case Study - Smartphone dataset¶

Description

Code - https://colab.research.google.com/drive/1TGYxt3X2YN7SlfocQg_-6A9pakp-WXZX?usp=sharing
Dataset - https://docs.google.com/spreadsheets/d/1oBG0ZtYiWzehWa1K6pV8huMtVEJxCY4C9vPaGCt1_gU/edit?usp=sharing

colab.research.google.com{ target="blank" title="https://colab.research.google.com/drive/1TGYxt3X2YN7SlfocQg-6A9pakp-WXZX?usp=sharing" }
docs.google.com

Session 29 - Exploratory Data Analysis (Titanic Dataset)¶

Description

Notebook Link: https://colab.research.google.com/drive/13rFqQJqU5RgxSdtUARZAUrzAoweE3rbQ?usp=sharing
---------------------------------------------------------------------------------------------------------------------------------------- Dataset Link : https://drive.google.com/drive/folders/1oFZxHRuAw_JI7soe46mmO61s-WM7jtQg?usp=share_link

Task 29¶

Assignment Link

Session on Data Cleaning Part 2¶

Description

Code - https://colab.research.google.com/drive/1TGYxt3X2YN7SlfocQg_-6A9pakp-WXZX?usp=sharing

colab.research.google.com{ target="blank" title="https://colab.research.google.com/drive/1TGYxt3X2YN7SlfocQg-6A9pakp-WXZX?usp=sharing" }

Session on EDA Case Study - Smartphones Dataset¶

Description

EDA code - https://colab.research.google.com/drive/1CLkCDQAFZfNmLO0MRf14bTKbnV-1NtMs?usp=sharing
Data Cleaning round 1 code - https://colab.research.google.com/drive/1TGYxt3X2YN7SlfocQg_-6A9pakp-WXZX?usp=sharing
Data Cleaning round 2 code - https://colab.research.google.com/drive/1E7nUdvyKpm6C-4oIw67rV6EufLEeCTrx?usp=sharing
Dataset (v3 & v5): https://drive.google.com/drive/folders/1xujCj-9CAwtenyPdtU07bMPXomarOSoV?usp=share_link

colab.research.google.com
colab.research.google.com
colab.research.google.com{ target="blank" title="https://colab.research.google.com/drive/1TGYxt3X2YN7SlfocQg-6A9pakp-WXZX?usp=sharing" }
drive.google.com

Week 11 - Data Analysis Process¶

Session 27 - Data Gathering | Data Analysis Process¶

Description

Codes:
https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day15%20-%20working%20with%20csv%20files
https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day16%20-%20working-with-json-and-sql
https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day17-api-to-dataframe
https://github.com/campusx-official/100-days-of-machine-learning/tree/main/day18-pandas-dataframe-using-web-scraping
https://github.com/campusx-official/pandas-io
 
Update:

@55:45 : Below screenshot is or similar example. There is a mistake in the video, instead of chunks
in the for loop, it is chunk.

Task 27¶

Assignment Link

Task 27 Solutions¶

Description

Code - https://colab.research.google.com/drive/1rTHjrE289QWSHD_I8UL_DAw_WzYMS5bh?usp=sharing

colab.research.google.com

Session 28 - Data Assessing and Cleaning¶

Description

Code - https://colab.research.google.com/drive/1ca-jlBvJ4uqpbCHFFgCFp9akIY7FSmGc?usp=sharing
Dataset - https://github.com/campusx-official/data-wrangling
 
For error at 2:03:00 : 'float' type is not subscriptable while extracting Phone number and email, use below code.
# For Phone Number
patients_df["contact"].apply(lambda x: find_contact_details(x)).apply(lambda x:'No data' if type(x[0])==float else x[0][-1])
# For Email:
patients_df["contact"].apply(lambda x: find_contact_details(x)).apply(lambda x:x[1])

Task 28¶

Assignment Link

Session on ETL using AWS RDS¶

Description

Code - https://colab.research.google.com/drive/1qBH_ZfTanr4N9QPQHfH0k9lbHpRa6EMx?usp=sharing
Dataset - https://www.kaggle.com/datasets/patrickb1912/ipl-complete-dataset-20082020
https://www.kaggle.com/datasets/harsha547/indian-premier-league-csv-dataset
Dream11 - https://www.dream11.com/games/point-system

Session on Advanced Web Scraping using Selenium¶

Description

Code - https://github.com/campusx-official/advanced-web-scraping
Chrome Driver - https://chromedriver.chromium.org/downloads
Selenium docs - https://selenium-python.readthedocs.io/

Week 10 - Data Visualization Continued¶

Session 25 - Plotting using Seaborn¶

Description

Code - https://colab.research.google.com/drive/1_Mk2NWYBzNxICokEtJFxsf6haq_1YIDA?usp=sharing

colab.research.google.com

Task 25¶

Assignment Link

Task 25 Solutions¶

Description

Code - https://colab.research.google.com/drive/1uWh7VoNYpBDzkeOhOfWiDT_fvYGxHN8B?usp=sharing

colab.research.google.com

Session 26 - Plotting using Seaborn Part 2¶

Description

Code - https://colab.research.google.com/drive/18GuhOaBBhaBJ9RtVNHRJQzNxNPPSKBrD?usp=sharing

Seaborn Theming and Color Palette: https://colab.research.google.com/drive/1FjKejgCJwUsxm_XYiRf25-jCvW4rDK64?usp=sharing

Task 26¶

Assignment Link

Task 26 Solutions¶

Description

Code - https://colab.research.google.com/drive/19MufvB3-9Owf72SZFheBwnqu0_yU_8VW?usp=sharing

colab.research.google.com

Session on Open Source Software Part 1¶

Description

Github discussions - https://resources.github.com/devops/process/planning/discussions/
Github Projects - https://docs.github.com/en/issues/planning-and-tracking-with-projects/learning-about-projects/about-projects
Github Actions - https://docs.github.com/en/actions/learn-github-actions/understanding-github-actions
 

Session on Open Source Software Part 2¶

Description

Websites
https://github.com/explore
https://www.codetriage.com/?language=Python
https://firstcontributions.github.io/#project-list
https://www.firsttimersonly.com/
https://24pullrequests.com/
https://summerofcode.withgoogle.com/archive/2022/organizations
https://hacktoberfest.com/

Week 9 - Data Visualization¶

Session 23 - Plotting using Matplotlib¶

Description

Datasets used in the session - https://drive.google.com/drive/folders/1_TyTVEMxhEoIs1nsU4V_p1rH5x4jLSiW?usp=share_link Notebook Links -https://colab.research.google.com/drive/1ksmroQtN_KoCeJzpzPgGAbLv0UG_6G_a?usp=sharing

Task 23¶

Assignment Link

Task 23 Solutions¶

Description

Code - https://colab.research.google.com/drive/1ssQKshkqJvIKnphx0JQfClJqt_vS67rb?usp=sharing

colab.research.google.com

Session 24 - Advanced Matplotlib¶

Description

Datasets used in the session - https://drive.google.com/drive/folders/17q7WRLJ7hdkA7nk8J_GUTEcZ7WHgPKUg?usp=share_link
Notebook Links - https://colab.research.google.com/drive/14TP6tNzUT5M0YfgzwTMF_6WBuQkLUgXp?usp=sharing

Task 24¶

Assignment Link

Task 24 Solutions¶

Description

Code - https://colab.research.google.com/drive/1TXxHzzrAgXCrbzp3kkqsf8OBnuQlZzI3?usp=sharing

colab.research.google.com

Session on Plotly(Express)¶

Description

Code - https://colab.research.google.com/drive/11Ya6Pi2pNoHT3yLr2fGbl-Zd6cz67x4M?usp=sharing

colab.research.google.com

Making a Corona virus(Covid-19) Dashboard using Plotly and Dash¶

Description

    You can get the code and datasets from here: https://github.com/campusx-official/campusx-official

Dataset : https://github.com/NitRookies/COVID19_Codechef/

Project using Plotly¶

Description

Datasets - https://www.kaggle.com/datasets/sirpunch/indian-census-data-with-geospatial-indexing
https://www.kaggle.com/datasets/danofer/india-census?select=india-districts-census-2011.csv
Kaggle Notebook - https://www.kaggle.com/code/campusx/notebook1f43313be3
Project Files - https://github.com/campusx-official/india-data-viz-mini-project
 
For ModuleNotFoundError in Session Indian Startup Funding at timestamp : 2:13:00 :

 -> Similar Issue been resolved in this session from Timestamp 1:54:35

Week 8 - Advanced Pandas Continued¶

Session 21 - MultiIndex Series and DataFrames¶

Description

Datasets used in the session - https://drive.google.com/drive/folders/1AP_M96SnIe985aQQp9SmDkz69AXHrs5t?usp=share_link Notebook Link - https://colab.research.google.com/drive/17l8EddlrS2Ed35frmeS6cHAvdf5Fbw-g?usp=sharing

Task 21¶

Assignment Link

Task 21 Solutions¶

Description

Code - https://colab.research.google.com/drive/1v3YVApNLyFyK3VucW08MYg6G8Lmf2yUw?usp=sharing

colab.research.google.com

Session 22 - Vectorized String Operations | DateTime in Pandas | Pivot Table¶

Description

 
Datasets used in the session - https://drive.google.com/drive/folders/1Vy1LilxgmyBiDg-UAnrBnJ5R1XBvwHGx?usp=share_link
 
Notebook Link -
Multi Index Object : https://colab.research.google.com/drive/17l8EddlrS2Ed35frmeS6cHAvdf5Fbw-g?usp=sharing
Strings : https://colab.research.google.com/drive/1IbvN3BABXN2sgxNr3EckyB_WO4tP0EpE?usp=sharing
Date Time : https://colab.research.google.com/drive/1zkfBGu48iLfJWNzAosD_qbCjzi7dYf8-?usp=sharing

Task 22¶

Assignment Link

Task 22 Solutions¶

Description

Code - https://colab.research.google.com/drive/1EV72ez6mbkzdyJvJdq3fFS0KmXAxYfGx?usp=sharing

colab.research.google.com

Pandas Case Study - Time Series Analysis¶

Description

Code - https://colab.research.google.com/drive/12G4cSJkdYAE6tLjilZ90ou_YfP1xXij8?usp=sharing
Datasets - https://drive.google.com/drive/folders/15WZn-YqSRbEAMM3ErlDdm3zDu6m47tUb?usp=share_link

Pandas Case Study 2 - Working with textual data¶

Description

Code - https://www.kaggle.com/campusx/pandas-on-nlp-data
Dataset - https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews

Week 7 - Advanced Pandas¶

Session 19 - GroupBy Object in Pandas¶

Description

 
Datasets used in the session - https://drive.google.com/drive/folders/1IiMIOGCv-giUV_rtF02sImgkaVuAQxkz?usp=share_link
Notebook Link - https://colab.research.google.com/drive/1JZwTCZp2kbiTACzcuXmWq_FL-GRNo9ym?usp=sharing

Task 19¶

Assignment Link

Task 19 Solutions¶

Description

Code - https://colab.research.google.com/drive/157g2UZM9StxNMz-dUw2uPH-JeKn6z269?usp=sharing
IPL Delevries Dataset (for Q5 - to Q8, ) : https://docs.google.com/spreadsheets/d/1ROM5oTEEMXfnBHAmz3XC5lwGMgnQaw80PiXadKHRgGU/edit?usp=sharing

Session 20 - Merging, Joining & Concatenating¶

Description

Datasets used in the session - https://drive.google.com/drive/folders/1tE0LxbzsVX70y8Br_VxiZas288ODDBup?usp=share_link
Notebook Link - https://colab.research.google.com/drive/1Xs7On5fr6ZZnrwGxgMXWld2XUrniqXGj?usp=sharing

Task 20¶

Assignment Link

Task 20 Solutions¶

Description

Code - https://colab.research.google.com/drive/1aLK7SUm8v2rRhJSU-wXyWERFMcBWC1mK?usp=sharing

colab.research.google.com

Session on Streamlit¶

Description

Code - https://github.com/campusx-official/streamlit-basics
Learn LaTeX - https://www.overleaf.com/learn/latex/Learn_LaTeX_in_30_minutes#What_is_LaTeX.3F
Learn Markdown - https://www.markdownguide.org/basic-syntax/#images-1
Streamlit docs - https://docs.streamlit.io/library/api-reference
Dataset link - https://www.kaggle.com/datasets/sudalairajkumar/indian-startup-funding
Plan of Action - https://docs.google.com/document/d/1zk4751zmG2b4XnYGW06tu0MWyr2PgLlMaSci7eUVL2M/edit?usp=sharing

Pandas Case Study - Indian Startup Funding¶

Description

Code - https://github.com/campusx-official/streamlit-basics
Kaggle code - https://www.kaggle.com/campusx/startup-data-analysis
Plan of Action - https://docs.google.com/document/d/1zk4751zmG2b4XnYGW06tu0MWyr2PgLlMaSci7eUVL2M/edit?usp=sharing
Dataset - https://www.kaggle.com/datasets/sudalairajkumar/indian-startup-funding
 
Timestamp -> 0:40:50 
# Converting date column datatype
df['date'] = df['date'].replace({'05/072018':'05/07/2018', '01/07/015':'01/07/2015', '22/01//2015':'22/01/2015'})
df['date'] = pd.to_datetime(df['date'])
For Issue at time 2:03:58 :

Use session state for option:

# Like below, the rest codes are the same as of sir's GitHub repo.
st.session_state.option = st.sidebar.selectbox(
    'Select One', ['Overall Analysis', 'StartUp', 'Investor'], key='analysis')
option = st.session_state.option
if option == 'Overall Analysis':
    load_overall_analysis()
 
For ModuleNotFoundError at 2:13:00 :

 -> Similar Issue been resolved in Week 9: Project Using Plotly session from Timestamp 1:54:35
 

Session on Git¶

Description

Download Git - https://git-scm.com/download/win
What is GIT

What is VCS/SCM

Examples of VCS

Why git/VCS is needed

Types of VCS

- Centralized

- Distributed

Advantages

- Version control

- Bug Fixing

- doing non-linear development

- collaborative development

************************************

How git works? -> terminology

installing git

************************************

Creating a repo

cloning someone else's repo

status

************************************

Making Changes

- add

- commit

- When to commit?

- commit messages?

** short

** Explain what

** rule of thumb no and

** This commit will ...

- add .

- gitignore

***********************************

Seeing commits

- log -> oneline -> stat -> p

- show

- seeing commits of someone else's repo

- diff

**********************************

Creating versions of a software

- tag->X.Y.Z

X – The major version, used for making major and backward-incompatible changes.

Y – The minor version, used for adding functionality while maintaining backwards compatibility.

Z – The patch version, used for making small bug fixes while maintaining backwards compatibility.

- deleting tag

- adding tag to a past commit

**********************************

GIT PDF : https://drive.google.com/file/d/1jmialN0Jhhuj5fl2K1N7R9LosoIjrtDb/view?usp=sharing

Session on Git and Github Part 2¶

Description

******************************************************

Non linear development(Branching)

******************************************************

-> Scenario(Individual)

-> Scenario(Team)

-> Using branches -> You had one branch already

-> concept of head pointer

HEAD is the reference to the most recent commit in the current branch. This means HEAD is just like a pointer that keeps track of the latest commit in your current branch.

-> Creating branches on head

-> Creating branches on past commits

-> Show all branches -> Active Branch

-> switch between branches->How this works?

-> Understanding what will come under a branch(git log)

-> Making new commits in all branches(git log)

-> see all branches at once -> --graph --all

-> deleting branches

******************************************************

Merging Branches

******************************************************

-> What is merging

-> What happens at merging

** A new commit is created on merging

** look at the branches that it's going to merge

** look back along the branch's history to find a single commit that both branches have in their commit history

** combine the lines of code that were changed on the separate branches together

** makes a commit to record the merge

** Note - Merging happens at the checked out branch. No new branches are created

-> Types of merging -> Fast Forward -> Regular(Divergent branches)

-> Fast Forward -> show log

-> Merging Divergent Branches -> show log

-> Merge Conflict

(<<<<<<< HEAD) everything below this line (until the next indicator) is code of current branch

(=======) is the end of the original lines, everything that follows (until the next indicator) is what's on the branch that's being merged in

(>>>>>>> heading-update) is the ending indicator of what's on the branch that's being merged in

-> Resolving Conflicts

****************************************************************************************************

Undoing Changes

****************************************************************************************************

-> editing the last commit message

-> forgot to add some files to the last commit

-> rolling back to a specific state using show

-> revert a commit

*****************************************************************************************************

Working with a remote repo

******************************************************************************************************

-> Need -> scenario-> collaboration

-> The flow diagram

-> create a new repo on github

-> add remote(git remote add origin )

-> push code(git push  )

-> git log -> tracking branch

-> add a readme file 

-> pull code
GIT PDF : https://drive.google.com/file/d/1jmialN0Jhhuj5fl2K1N7R9LosoIjrtDb/view?usp=sharing

drive.google.com

Week 6 - Pandas¶

Session 16 - Pandas Series¶

Description

Code - https://colab.research.google.com/drive/1Te483lmZDDKzzU0YFnuHiSy87AHquJKF?usp=sharing
Download Datasets - https://drive.google.com/drive/folders/1aUJ85Ea-TxVWSQNvtKjkz75fqEAqFHk6?usp=share_link

Important Series Methods | Supplementary Session¶

Description

Code - https://colab.research.google.com/drive/1Te483lmZDDKzzU0YFnuHiSy87AHquJKF?usp=sharing
Download Datasets - https://drive.google.com/drive/folders/1aUJ85Ea-TxVWSQNvtKjkz75fqEAqFHk6?usp=share_link

Task 16¶

Assignment Link

Session 17 - Pandas DataFrame¶

Description

Dataset - https://drive.google.com/drive/folders/18ZsL5K8PiFT9GS1631lpGIE3HMR0n1G8
Code - https://colab.research.google.com/drive/1k_CV931NE4_jMxmu3k2ASPMmIA4g5Wcn?usp=sharing

Task 17¶

Assignment Link

Session 18 - Important DataFrame Methods¶

Description

Datasets - https://drive.google.com/drive/folders/18ZsL5K8PiFT9GS1631lpGIE3HMR0n1G8?usp=share_link
Code - https://colab.research.google.com/drive/1a5Yii5DmHtaNH2QyMInegixa26pLWHo7?usp=sharing

Task 18¶

Assignment Link

Session on API Development Using Flask¶

Description

Code - https://github.com/campusx-official/ipl-api-service

github.com

Week 6 - Numpy Interview Questions¶

Description

Code - https://github.com/campusx-official/ipl-web-app
Code - https://colab.research.google.com/drive/1_QkzCawz5aofyLJXbGXF-vNsTAxjnNgG?usp=sharing

Task 16 Solutions¶

Description

Code - https://colab.research.google.com/drive/16F_MMON9J1fFDYrxtyYaQDP80SQjj8x0?usp=sharing

colab.research.google.com

Task 17 Solutions¶

Description

Code - https://colab.research.google.com/drive/1GrYEtG87mn5-_MH7rWPTZJJxJ8jl5mWI?usp=sharing

colab.research.google.com

Task 18 Solutions¶

Description

Code - https://colab.research.google.com/drive/1NcOunueBaiVEkj2Y4DcGyf2jBIHlE9QQ?usp=sharing
Question No. 6 Solution:
Modification in notebook: While calculation home_win and away_win, use bitwise AND operator-(&). In solution bitwise OR is given.
home_win = df[(df.WinningTeam == team) & (df.Team1 == team)].shape[0] / df[df.Team1 == team].shape[0] * 100
away_win = df[(df.WinningTeam == team) & (df.Team2 == team)].shape[0] / df[df.Team2 == team].shape[0] * 100

colab.research.google.com

Week 5 - Numpy¶

Session 13 - Numpy Fundamentals¶

Description

Code - https://colab.research.google.com/drive/1GEkNfxnCPfzX8TCymkZJvWbkpGKZRgiK?usp=sharing

colab.research.google.com

Task 13¶

Assignment Link

Session 14 - Advanced Numpy¶

Description

Code - https://colab.research.google.com/drive/1RVe07-2VU4Jft8GLFyf10PrQIOVfffaR?usp=sharing

colab.research.google.com

Task 14¶

Assignment Link

Session 15 - Numpy Tricks¶

Description

Code - https://colab.research.google.com/drive/1PUW-yXxbkSYgvlhC5_zf30aOVyKYo9AC?usp=sharing

colab.research.google.com

Task 15¶

Assignment Link

Session on Web Development using Flask¶

Description

Code - https://github.com/campusx-official/nlp-web-app
HTML Playlist - https://www.youtube.com/watch?v=jp3gE2Ow6Fw&list=PLKnIA16_RmvaPjreiKXncoLCLQKE0I_9D&ab_channel=CampusX
CSS Playlist - https://www.youtube.com/watch?v=4d79CMy5-LI&list=PLKnIA16_RmvYz9J-59mtVWLQuPbsWd56P&ab_channel=CampusX
Fundamentals of Web Development - https://www.youtube.com/watch?v=XEq5gEhqPNE&list=PLKnIA16_RmvaAtO498fZOVVomyx01yZhx&ab_channel=CampusX

Task 13 Solutions¶

Description

Code - https://colab.research.google.com/drive/15sGD09W5CVrYxuayY_81vDXlv6a7f-PB?usp=sharing

colab.research.google.com

Task 14 Solutions¶

Description

Code - https://colab.research.google.com/drive/1UFnFdlSnm2wGcLFKE8mOifgcpfSICoKJ?usp=sharing

colab.research.google.com

Task15 Solutions¶

Description

Code - https://colab.research.google.com/drive/1tesod4lk55ZgC2a4xB0fUhUAD0nz1cTT?usp=sharing

colab.research.google.com

Week 4 - Advanced Python¶

Session 10 - File Handling + Serialization & Deserialization¶

Description

Code https://colab.research.google.com/drive/1TP7ks1pnEzJwwzHtswkSYvMWwo2HeRxM?usp=sharing
at Timestamp : 54:00 Reading a big text file
with open('big.txt', 'r') as f:

    chunk_size = 10

    data = f.read(chunk_size)

    while len(data) > 0:

        print(data, end='****')

        data = f.read(chunk_size)

colab.research.google.com

Task 10¶

Assignment Link

Session 11 - Exception Handling¶

Description

Code - https://colab.research.google.com/drive/1-yYl5wagPH1ctS_x-RBDyMFj-ez8hWvS?usp=sharing

colab.research.google.com

Task 11¶

Assignment Link

Session 12 - Decorators & Namespaces¶

Description

Code - https://colab.research.google.com/drive/1P5jtGzaVkIjEFFr6WSrzs0capal32QPn?usp=sharing

colab.research.google.com

Supplementary Session on Iterators¶

Description

Code - https://github.com/campusx-official/python-iterators-and-iterables

github.com

Supplementary Session on Generators¶

Description

Code - https://github.com/campusx-official/python-generators

github.com

Task 12¶

Assignment Link

Session on Resume Building¶

Description

PPT - https://docs.google.com/presentation/d/1hpsmCqwk6cxqulpDmPvnbYdYxzW83msvofIGn77Bczo/edit?usp=sharing
Blog Link - https://zety.com/blog/data-scientist-resume-example

Session on GUI Development using Python [2nd Dec - Fri]¶

Description

Code - https://github.com/campusx-official/nlpapp/tree/master

github.com

Week 4 - Interview Questions¶

Description

Code - https://colab.research.google.com/drive/19YHAGgm5856CuJyGmtO5kpewmXJzbUcK?usp=sharing

colab.research.google.com

Task 10 Solutions¶

Description

Code - https://drive.google.com/file/d/1vz4rlbZS1y9onWq6S3YQ6a0MbyTS0-3y/view?usp=sharing

drive.google.com

Task 11 Solutions¶

Description

Code - https://colab.research.google.com/drive/1DhvCxLlAduDxP3k7SVfwsmFL9ZKbDPsU

colab.research.google.com

Task 12 Solutions¶

Description

Code - https://colab.research.google.com/drive/1IHLZkKHJY6YBTB2aOVtfT5WVV_KJ9pqV?usp=sharing

colab.research.google.com

Week 3 - Object Oriented Programming(OOP)¶

Session 7 - OOP Part 1 | Class & Object¶

Description

Code - https://colab.research.google.com/drive/1bzZ5WiHXcnsxZThKEefsM8HOuBPwCany?usp=sharing

colab.research.google.com

Task 7¶

Assignment Link

Task 7 Solutions¶

Description

Code - https://colab.research.google.com/drive/1r_jeTn2XoKLaWq70QYhtLCNnr2FYCaYr?usp=sharing

colab.research.google.com

Session 8 - OOP Part 2 | Encapsulation & Static Keyword¶

Description

Code - https://colab.research.google.com/drive/1F3Y_zoZH0BDdvcFwrHS46YXj8CCNDRqn?usp=sharing

colab.research.google.com

Task 8¶

Assignment Link

task-8-solutions¶

Description

Code - https://colab.research.google.com/drive/1Ia10uit7iguGlWjwG7gMF7h3AwNRL2KX?usp=sharing

colab.research.google.com

Session 9 - OOP Part 3 | Inheritance & Polymorphism¶

Description

Code - https://colab.research.google.com/drive/1_Pd9W1yltcfPNGPj6oDqIM5m7sadegcY?usp=sharing

colab.research.google.com

What is Abstraction | OOP Concept¶

Description

Code - https://colab.research.google.com/drive/1_Pd9W1yltcfPNGPj6oDqIM5m7sadegcY?usp=sharing

colab.research.google.com

Task 9¶

Assignment Link

task-9-solutions¶

Description

Code - https://colab.research.google.com/drive/1knllPL_sSnfqTrrqh0Ru9VpqLbGvPNjS?usp=sharing

colab.research.google.com

Session on OOP Project¶

Description

Code - https://colab.research.google.com/drive/1WyFLxv8gs5nK1YeAx7oO3IU7qev8PYDP?usp=sharing

colab.research.google.com

Week 3 - Interview Questions¶

Description

Interview Questions class discussion -
https://colab.research.google.com/drive/1pFSCaenXUtrWRPgP4zTOcM_2GQIMjU4z?usp=sharing
More Interview Questions -
https://colab.research.google.com/drive/1LlTdY0LeYdI893EtSN3CqZOFg_bFxrPu?usp=sharing

Week 2 - Python Data Types¶

Session 4 - Lists in Python¶

Description

Code - https://colab.research.google.com/drive/1VUb_lXKIcypsWkKiMJ_NDwX2msdgEVGH?usp=sharing

colab.research.google.com

Task 4¶

Assignment Link

Task 4 Solutions¶

Description

Code - https://colab.research.google.com/drive/1uBqC9zOZH3e26WWc-R4dplohg2uvR-Xs?usp=sharing
Problem 14 : 
print([[row[i] for row in matrix]for i in range(len(matrix))]) # Only works for Square Matrix.
# Updated code 

print([[row[i] for row in matrix]for i in range(len(matrix[0]))])

colab.research.google.com

Session 5 - Tuples + Sets + Dictionary¶

Description

Code - https://colab.research.google.com/drive/1PtRoTO1A4HwZudf0ZqCq-sqiDpamVeyH?usp=sharing

colab.research.google.com

Task 5¶

Assignment Link

Task 5 Solutions¶

Description

Code - https://colab.research.google.com/drive/17h1BUmK2qb9YZCCZKhUXsf6hRJuO42_t?usp=sharing

colab.research.google.com

Session 6 - Functions in Python¶

Description

Code - https://colab.research.google.com/drive/1DfnVcPYRGJDDrKcpYmEAuPHAR7XeTaKH?usp=sharing

colab.research.google.com

Task 6¶

Assignment Link

Task 6 Solutions¶

Description

Code - https://colab.research.google.com/drive/1r5De6VqloVnqqR94x2jz23JS2q9e2mQ2?usp=sharing

colab.research.google.com

Session on Array Interview Questions¶

Description

Code - https://colab.research.google.com/drive/1xUoy5AW_vlI92xbIcfEbnx0ZnGb7IKbj?usp=sharing
Time Stamp: 40:00. Q10 Maximum Sum SubArray.

Getting the best sum but Array printed is not correct.

This is happening because of list referencing. Say we have a list a = [1,2,3] and another list b which is same as a like a = b, so if we make changes in a, b will also change. But if we assign b like:  b = a[:] This time upon changing a, b will not change.
Correction in approach 1 : 

d[sum(subarray)] = subarray[:] # Cloning will solve this.

Correction Correction in Approach 2:

best_seq = curr_seq[:]

colab.research.google.com

Week 2 - Interview Questions¶

Description

Question List - https://colab.research.google.com/drive/198UqZ59bcyluNKgIyw9K20uToSaHSjT5?usp=sharing

colab.research.google.com

Week 1 - Basics of Python Programming¶

Session 1 - Python Basics¶

Description

Code used in the session - https://colab.research.google.com/drive/10jVbuKq2Owsz_hIIrXA9Y09DMLdDt_21?usp=sharing

colab.research.google.com

Task 1¶

Assignment Link

Session 2 - Operators + If-Else + Loops¶

Description

Code for the session - https://colab.research.google.com/drive/1dJIncqudN2wFNZ1P3_1sdJX76pzw-s-4?usp=sharing
Session Code (Updated) : https://colab.research.google.com/drive/1He-CC_4GUaswgQ2NFg8exc23A-mChTwK?usp=sharing

Task 2¶

Assignment Link

Week 1 - Task 1 + Task 2 Solutions¶

Description

Code for Task 1 Solution - https://colab.research.google.com/drive/15ouziM6EkwvOYIJM_Z4kX9AUJ6TmmZnh?usp=sharing
Code for Task 2 Solution - (Updated Link) - https://colab.research.google.com/drive/1mkBnQb0IELTQCpQ9nhtEYhqOgG84DTVd?usp=share_link

Session 3 - Python Strings¶

Description

Code - https://colab.research.google.com/drive/1l1TCiGQM_fyRLX-kM2HZhKdrNY7aZHKA?usp=sharing

colab.research.google.com

Programming Problems on Strings¶

Description

Code - https://colab.research.google.com/drive/1l1TCiGQM_fyRLX-kM2HZhKdrNY7aZHKA?usp=sharing

colab.research.google.com

Task 3¶

Assignment Link

Week 1 Task 3 Solutions¶

Description

Code - https://colab.research.google.com/drive/1HLKrP34x9ypGXqN44XcAvlwuEmWZ7idg?usp=sharing

colab.research.google.com

How to Build a Portfolio Website for Data Science¶

Description

Portfolio Website Example - https://www.kunalgohrani.info/
HTML playlist - https://www.youtube.com/watch?v=jp3gE2Ow6Fw&list=PLKnIA16_RmvaPjreiKXncoLCLQKE0I_9D&index=1&t=0s&ab_channel=CampusX
CSS Playlist - https://www.youtube.com/watch?v=4d79CMy5-LI&list=PLKnIA16_RmvYz9J-59mtVWLQuPbsWd56P&index=1&t=0s&ab_channel=CampusX

Session on Time Complexity¶

Description

PPT - https://docs.google.com/presentation/d/1oz_Uq3EvDzSJTi7uDUDSGDxRvXHxtNE_6fxvAYeGUnY/edit?usp=sharing
Code - https://colab.research.google.com/drive/1tZn_tsleyYYh-_fOB3fDpaSIoNg4BmJI?usp=sharing

Week 1 - Interview Questions¶

Description

Question List - https://colab.research.google.com/drive/1Qh4SPnaqWXC3tA3EwzcV6BSKOEM0e9E4?usp=sharing

colab.research.google.com

Career Pe Charcha¶

Session on Open Source Software¶

Description

    Github Discussions - https://resources.github.com/devops/process/planning/discussions/

Github Actions - https://docs.github.com/en/actions/learn-github-actions/understanding-github-actions

Github Projects - https://docs.github.com/en/issues/planning-and-tracking-with-projects/learning-about-projects/about-projects