San Diego Wildfire Prediction & Machine Learning Coursework

This repository contains my comprehensive work for the Machine Learning course. It includes my final capstone project on wildfire prediction and a collection of weekly assignments and experiments covering various ML and DL algorithms.

1. Capstone Project: San Diego Wildfire Risk & Intensity Prediction

This project utilizes machine learning techniques to predict wildfire risks and potential fire intensity in San Diego County, based on historical meteorological data and satellite hotspot data.

Project Overview

The model consists of two prediction phases:

Wildfire Risk Prediction (Binary Classification):
- Objective: Predict whether a new wildfire will occur on a given day.
- Models: Compared Logistic Regression, XGBoost, and Neural Networks.
- Result: The Neural Network model achieved the best performance in terms of AUC-ROC and AUPRC.
Fire Intensity Prediction (Multi-class Classification):
- Objective: Classify fire intensity into "Small," "Medium," or "Large" based on Fire Radiative Power (FRP).
- Challenges: ADDressed severe class imbalance using SMOTE (Synthetic Minority Oversampling Technique).
- Result: The model performed reasonably well on small fires but faced challenges in distinguishing between medium and large intensity fires due to data limitations.

Data Sources

Fire Data: NASA FIRMS (VIIRS satellite data), including latitude, longitude, time, and FRP.
Weather Data: Visual Crossing Weather, containing hourly historical records (temperature, humidity, wind speed, etc.) for San Diego from 2020 to 2025.

2. Coursework & Experiments

This section documents the experiments conducted throughout the course, ranging from classical machine learning to advanced deep learning models.

Key Topics Covered

Regression & Classification:
- Implementation of Linear and Polynomial Regression using Gradient Descent (Batch, Stochastic, Mini-batch).
- Logistic Regression and Softmax Regression.
- Binary and Multiclass classification tasks using datasets like MNIST and Fashion-MNIST.
Support Vector Machines (SVM):
- Linear and Non-linear SVM classification and regression.
- Application of Kernel tricks (Polynomial, RBF).
Ensemble Learning:
- Implementation of Voting classifiers, Bagging, and Pasting.
- Random Forests, AdaBoost, and Gradient Boosting algorithms.
Deep Learning & Neural Networks:
- Training Deep Neural Networks (DNN) with techniques like He initialization, Batch Normalization, and Dropout.
- Optimization algorithms including Nesterov Accelerated Gradient, RMSProp, and Adam.
Sequence Models & NLP:
- Time series forecasting using RNNs, LSTMs, and GRUs (e.g., predicting Chicago transit ridership).
- Natural Language Processing tasks including sentiment analysis and text generation (Char-RNN).
- Encoder-Decoder architectures and Attention mechanisms.

Tech Stack

Language: Python 3.8
Data Processing: Pandas, NumPy
Visualization: Matplotlib, Seaborn
Machine Learning: Scikit-learn, XGBoost, Imbalanced-learn
Deep Learning: TensorFlow (Keras)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
DL_FIRE_J2V-C2_616940		DL_FIRE_J2V-C2_616940
datasets		datasets
my_shakespeare_model		my_shakespeare_model
tiny-gpt2		tiny-gpt2
第三节		第三节
第二节		第二节
第六节		第六节
第十节+实验8+练习		第十节+实验8+练习
第四节/第四节		第四节/第四节
.gitignore		.gitignore
1-Python基础1.ipynb		1-Python基础1.ipynb
11 rnn transformer.ipynb		11 rnn transformer.ipynb
11_rnn_transformer_no_cudnn.ipynb		11_rnn_transformer_no_cudnn.ipynb
2-Python基础2.ipynb		2-Python基础2.ipynb
2classification.ipynb		2classification.ipynb
3-Python基础练习题(复习).ipynb		3-Python基础练习题(复习).ipynb
3regression.ipynb		3regression.ipynb
4-Numpy学习1.ipynb		4-Numpy学习1.ipynb
4decision_tree.ipynb		4decision_tree.ipynb
5-Numpy学习2.ipynb		5-Numpy学习2.ipynb
6-1&2周练习题(实验作业).ipynb		6-1&2周练习题(实验作业).ipynb
7boosting.ipynb		7boosting.ipynb
Housing-Copy1.ipynb		Housing-Copy1.ipynb
Housing.ipynb		Housing.ipynb
README.md		README.md
San Diego, CA, United Sta... last7days.csv		San Diego, CA, United Sta... last7days.csv
best_tree		best_tree
mlproject.ipynb		mlproject.ipynb
sd_fire_data.csv		sd_fire_data.csv
weather.csv		weather.csv
圣地亚哥县野火风险评估与强度预测.md		圣地亚哥县野火风险评估与强度预测.md
第九节+实验7+练习.ipynb		第九节+实验7+练习.ipynb
第八节+实验6+练习.ipynb		第八节+实验6+练习.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

San Diego Wildfire Prediction & Machine Learning Coursework

1. Capstone Project: San Diego Wildfire Risk & Intensity Prediction

Project Overview

Data Sources

2. Coursework & Experiments

Key Topics Covered

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

San Diego Wildfire Prediction & Machine Learning Coursework

1. Capstone Project: San Diego Wildfire Risk & Intensity Prediction

Project Overview

Data Sources

2. Coursework & Experiments

Key Topics Covered

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages