This repository contains all the assignments and solutions for the Applied Data Science course by Amir Hesam Salavati, including tasks related to data manipulation, visualization, machine learning, and deep learning. The materials of the course are accessible through this link.
- Familiarization with Python tools for data science, including Colab and Kaggle.
- Practical tasks using datasets to explore the fundamentals of data manipulation.
-
Techniques for analyzing datasets, handling missing values, and preparing data for modeling.
-
Focus on converting data into usable numerical formats and normalizing values.
-
Practice with a variety of visualization techniques (e.g., pie charts, scatter plots, interactive charts) to uncover insights.
- Creation of new features using methods like binning, aggregation, and dimensionality reduction (e.g., PCA).
- Evaluation of regression and classification models using metrics such as MAE, MSE, Precision, Recall, and F1-Score.
-
Implementation of linear regression, logistic regression, ridge regression, LASSO, and kernel regression for predictive analysis.
- Exploration of binary and multiclass classification models, including SVMs, KNN, decision trees, and boosting techniques (e.g., XGBoost, LightGBM).
- Building and training deep learning models using frameworks like Keras and PyTorch.
- Tasks include binary classification, regression, and time-series analysis.
-
Application of convolutional neural networks (CNNs) for image classification, transfer learning, and data augmentation.
- Implementation of autoencoders, GANs, and tools like Grad-CAM, SHAP, and LIME to interpret model predictions.
-
Techniques to handle imbalanced datasets, including SMOTE, cost-sensitive training, and automation tools like TPOT and AutoSklearn.
- Autoencoders, GANs, and tools like Grad-CAM, SHAP, and LIME to interpret model predictions.
- Techniques to handle imbalanced datasets, including SMOTE, cost-sensitive training, and automation tools like TPOT and AutoSklearn.
Feel free to submit issues or pull requests if you find bugs or want to improve this repository.
This work was completed as part of the Applied Data Science course taught at Tehran Institute for Advanced Studies (TeIAS), Khatam University.
This project is open-source and available under the MIT License.