I'm Osama Mustafa, a Data Analyst at Turing, currently working on fine-tuning and evaluating multimodal LLM models. My expertise spans building end-to-end data pipelines, real-time streaming systems, and cloud-scale data lakehouses. I am passionate about scalable data architectures and actively seeking opportunities in data engineering.
Pinned Loading
-
Enterprise_Retail_Data_Lakehouse
Enterprise_Retail_Data_Lakehouse PublicBatch retail data lakehouse on Databricks: Delta Live Tables (bronze β silver β gold), Unity Catalog, synthetic data generator, and an executive analytics dashboard.
Python
-
Realtime-Profile-Ingestion-Pipeline
Realtime-Profile-Ingestion-Pipeline PublicReal-time data pipeline that ingests user profiles from a REST API, streams them through Kafka, processes with Spark Structured Streaming, and persists to Cassandra β orchestrated by Airflow and fuβ¦
Python
-
Sentiment-Driven-Stock-Analysis
Sentiment-Driven-Stock-Analysis PublicThis project predicts stock market performance using sentiment analysis of news headline. The sentiment is visualized using Treemap
-
-
Plant-Disease-Detection-using-Deep-learning
Plant-Disease-Detection-using-Deep-learning PublicJupyter Notebook 1
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
