πΉ General Steps to Run Programs in Anaconda / Jupyter Notebook
- Install Anaconda (from anaconda.com/download).
- Open Anaconda Navigator.
- Click Launch under Jupyter Notebook.
- Your browser will open at
http://localhost:8888/tree.
- Click New β Python 3 (ipykernel) to create a new notebook.
- Write or paste your Python code in a cell.
- Press Shift + Enter to execute the cell.
- Save your work: File β Save and Checkpoint.
π‘ Tip: Always store your datasets (.csv, .xlsx) in the same folder as your
notebook for easy access.
π§© Problem Set 1 β Control Structures & Data Import/Export
- Use loops (for/while) and if-else statements for logic
building.
- Import data with
import pandas as pd β
pd.read_csv('filename.csv').
- Clean and process data (drop missing values, rename columns, etc.).
- Export using
df.to_csv('output.csv').
π€ Problem Set 2 β KNN Classification (e.g., Iris Dataset)
- Import:
from sklearn.neighbors import KNeighborsClassifier.
- Load data:
from sklearn.datasets import load_iris.
- Split dataset using
train_test_split.
- Fit model:
knn.fit(X_train, y_train).
- Predict:
y_pred = knn.predict(X_test).
- Evaluate accuracy using
accuracy_score().
π§ Problem Set 3 β NaΓ―ve Bayes Classifier
- Import:
from sklearn.naive_bayes import GaussianNB.
- Prepare features and labels (X, y).
- Train the model:
model.fit(X_train, y_train).
- Predict using
model.predict(X_test).
- Evaluate using accuracy, precision, recall, and F1-score.
π Problem Set 4 β Apriori Algorithm (Market Basket Analysis)
- Install library:
pip install mlxtend.
- Import:
from mlxtend.frequent_patterns import apriori, association_rules.
- Load dataset (e.g., Online Retail Dataset).
- Convert transactions into 0/1 format using one-hot encoding.
- Apply
apriori(df, min_support=0.05, use_colnames=True).
- Generate rules using
association_rules().
π Problem Set 5 β K-Means Clustering
- Import:
from sklearn.cluster import KMeans.
- Load a dataset (e.g., Iris, Customer, or Fruit dataset).
- Preprocess the data (scaling if needed).
- Choose number of clusters (k) using the elbow method.
- Apply clustering:
kmeans = KMeans(n_clusters=k).
- Visualize using
matplotlib.pyplot scatter plots.
π Problem Set 6 β Model Comparison and Evaluation
- Use datasets like Iris, Fruits, or Online
Retail.
- Compare models: Logistic Regression, SVM, and KNN using accuracy scores.
- Visualize results with bar or line charts.
β
Required Libraries:
pandas,
numpy,
matplotlib,
scikit-learn,
mlxtend
Install missing ones using:
pip install pandas numpy matplotlib scikit-learn mlxtend
β οΈ Note: Restart the Jupyter kernel if you install a new library during
runtime.