This repository contains our team's work across Sprints 1–4 for the forest modeling and classification challenge. The goal of this project is to develop machine learning models that predict key forest attributes — such as DBH (Diameter at Breast Height), CBH (Crown Base Height), species, genus, and plant functional type (PFT) — using a combination of Aerial LiDAR (ALS), Terrestrial LiDAR (TLS), and imputed treelists from FastFuels.
- FIA Database:
CA_TREE.csv,CA_PLOT.csv,REF_SPECIES.csv - Field Survey Data:
03_tree.csv,01_plot_identification.csv - ALS/TLS LiDAR Data: USGS
.lazfiles and.ptxTLS files - FastFuels TreeMap 2016: Treelists generated using the FastFuels API
- Geo Boundaries:
independence_lake_boundary.geojson,sedgwick_boundary.geojson
ttops.csv: ALS-derived tree tops detected from CHM and peak detectiontls_treelist.csv: TLS-derived tree-level datafftl_plots.csv: Predicted treelist from FastFuels for target plotscombined_intelimon_metrics.csv: TLS-derived structural metrics from IntELiMon
To run the notebooks, make sure you have the following Python libraries installed:
- numpy
- pandas
- matplotlib
- seaborn
- scikit-learn
- xgboost
- imblearn
- geopandas
- rasterio
- laspy
- pyproj
- folium
- plotly
- pdal
- skimage
Follow these steps to run the project successfully:
-
Clone the repository
Open a terminal and run:
https://github.com/Anshu-b/HeatMappers -
Install dependencies (different for each sprint)
pip install -r requirements.txt -
Start Jupyter
Run either:
jupyter notebook
or
jupyter lab -
Open and run the notebooks
- Notebooks are organized by sprint (see below).
- Run all cells from top to bottom.
- Outputs include classification reports, graphs, CHMs, and CSV files.
In order to run sprint 1a make sure unzip data.zip
sprint1_1-4_heatmappers.ipynbsprint1_5-6_HeatMappers.ipynb
sprint2_1-2_HeatMappers.ipynbsprint2_3-4_HeatMappers.ipynb
In order to run sprint 3 make sure unzip data.zip
sprint3_HeatMappers.ipynb
In order to run sprint 4 make sure unzip data.zip
sprint4-1-field_HeatMappers.ipynb– PFT, genus, and species classification from field datasprint4-2-tls-enhancement_HeatMappers.ipynb– Predicting CBH and CRsprint4-3-ff_HeatMappers.ipynb– Classification using FastFuels + predicted features
- Visualizations: CHMs, confusion matrices, histograms, treelist charts
- Model metrics: Accuracy, F1-score, precision, recall
- Classification reports per model/task
- CSV outputs like
ttops.csv,fftl_plots.csv - Final report summarizing all results and findings
Please see HeatMappers_project_report.pdf for:
- Model comparison (Random Forest, Logistic Regression, AdaBoost)
- PFT, genus, and species prediction performance
- Accuracy analysis of field-based vs. FastFuels-based models
- Discussion
Team Name: HeatMappers
Members:
- Ansh Bhatnagar
- Jiahe Qin
- Jeronimo Adames-Baena
- Ishayu Ghosh
- All notebooks for Sprints 1–4 are included in this repository.
- Final report is submitted as:
HeatMappers_project_report.pdf - URL to this repository is included in the Sprint 4 text submission.