I have a background in financial software implementation and support, where I gained hands-on experience working with business processes, databases, and real-world data.
My interests lie in data engineering, analytics, and automation. I enjoy building data pipelines, integrating data from different sources, transforming raw data into meaningful insights, and creating solutions that support data-driven decision-making.
Currently, I am expanding my skills through practical projects involving Apache Kafka, Apache Airflow, Apache Spark, PostgreSQL, MongoDB, ClickHouse, Grafana, and Power BI, while continuing to strengthen my expertise in Python and SQL.
- Python
- SQL
- Scala (Apache Spark Fundamentals)
- Linux
- Apache Kafka
- Apache Airflow
- Apache Spark
- ETL Development
- Data Cleaning & Transformation
- PostgreSQL
- SQL Server
- MongoDB
- ClickHouse
- Power BI
- Grafana
- Git & GitHub
- Docker
- REST APIs
- Web Scraping
Real-time spam classification pipeline using Kafka, Machine Learning, and PostgreSQL.
Tech: Kafka, Python, PostgreSQL, Scikit-Learn
End-to-end ETL workflow for extracting, transforming, and loading SMS datasets.
Tech: Python, PostgreSQL, ETL
Automated data ingestion pipeline that extracts data from the Divar API and stores it in PostgreSQL.
Tech: Python, REST API, PostgreSQL
Workflow orchestration and scheduling using Apache Airflow.
Tech: Apache Airflow, Python, PostgreSQL
Data preprocessing, cleaning, and transformation workflow for analytics-ready datasets.
Tech: Python, Pandas
Large-scale dataset processing, cleaning, and transformation using Apache Spark.
Tech: Spark, Python, Scala, Parquet
Analytical processing and reporting using ClickHouse, PostgreSQL, and Grafana.
Tech: ClickHouse, PostgreSQL, Grafana
Interactive dashboards and KPI monitoring for business analytics.
Tech: Grafana, PostgreSQL
Interactive Power BI dashboard for analyzing student performance metrics.
Tech: Power BI, DAX, Data Modeling
Web scraping project for collecting and storing perfume product data.
Tech: Python, Requests, BeautifulSoup, SQLite
- Apache Spark
- ClickHouse
- Data Warehousing
- Data Modeling
- Distributed Data Processing
- Real-Time Data Pipelines
- Data Engineering
- ETL & ELT Pipelines
- Data Platforms
- Analytics Engineering
- Big Data Processing
- Workflow Automation
LinkedIn: https://linkedin.com/in/samira-siavash
GitHub: https://github.com/SamiraSiavash
Email: s.siavash.m@gmail.com
Letβs connect and talk about data, BI, and engineering solutions!