Skip to content
View debashisdash1999's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report debashisdash1999

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
debashisdash1999/README.md

🙏🏻Greetings, I’m Choudhury Debashis Dash

💻 Data Engineer | Snowflake | ELT / ETL | AI-Driven Data Solutions

Snowflake Certified - Data Engineering Professional Certificate

📍 India  | 🌎 Open to roles across India (OnSite & Remote) and abroad (Remote)  | ⚡ Immediately Available


🚀 What I Do

I design and build data pipelines, cloud warehouses, and AI-driven analytics solutions that help businesses move from raw data to reliable decisions.

Over 3 years, I've worked across two ends of the data stack - engineering pipelines in Snowflake for a real client project (Mercedes-Benz: USA & Canada) at Infosys, and owning the full BI function at Troy Consultancy where I built dashboards used for daily decision-making.

My focus areas:

  • ELT pipeline development in Snowflake - from raw ingestion through Bronze → Silver → Gold layers
  • Automation using Snowpipe, Streams, and Tasks for CDC and scheduled workflows
  • AI-driven data engineering using Snowflake Cortex (Code & Analyst) for assisted SQL development and natural language querying
  • Semantic layer development to enable business-friendly data access
  • BI & reporting with Power BI, connecting to SQL Server, Excel, and web sources
  • Query & warehouse optimization - focused on performance and cost efficiency

🔧 Tech Stack

🤖 AI & Modern Data

Snowflake Cortex Semantic Layer NLQ

🗄️ Data Warehousing & Databases

Snowflake Snowflake SQL MS SQL Server PostgreSQL MySQL

☁️ Cloud & Storage

Azure Data Factory Azure Data Lake AWS S3

📊 BI & Visualization

Power BI

💻 Programming & Data Processing

Python Pandas

🤝 Collaboration

GitHub


📂 Featured Projects {Check my full profile (Repositories) for all projects}


🏗️ End-to-End Data Engineering


Built an end-to-end Snowflake data platform integrating AI-assisted development and semantic analytics. Implemented Medallion Architecture (Bronze → Silver → Gold) to process 13+ source files into an analytical Star Schema with SCD Type 2. Leveraged Snowflake Cortex Code for AI-assisted SQL generation, query optimization, and pipeline development, reducing manual effort. Built a semantic model using Cortex Analyst to enable natural language-to-SQL querying on 86K+ sales records, allowing business users to interact with data without writing SQL.

Snowflake Cortex Semantic Layer SCD2 Bronze Silver Gold


Full enterprise-style data warehouse built using Microsoft SQL Server, implementing a Bronze → Silver → Gold layered architecture with CRM and ERP source integration, stored procedure-based ETL, star schema modeling, and a Sales Data Mart with dim_customers, dim_products, and fact_sales.

MS SQL Server ETL Star Schema Bronze Silver Gold


Production-style Snowflake pipeline modeled on a food delivery platform, covering initial & delta loads, CDC using Streams, SCD Type 2 dimensions, a star schema fact table at order-item granularity, data governance with Tags & Masking Policies, and full automation via Stored Procedures and Tasks.

Snowflake Streams Tasks SCD2 Star Schema Data Governance


Enterprise-scale retail analytics solution for a 5M+ customer ecommerce company spanning 15 countries. Built on Snowflake with ADLS as external stage, ingesting CSV, JSON, and Parquet data. Implements Bronze → Silver → Gold layers, CDC with Streams, data quality pipelines, and Tasks and Gold layer views for sales performance, customer segmentation, and product analytics.

Snowflake Azure Data Lake Streams Tasks Parquet Bronze Silver Gold


❄️ Snowflake Concept Projects


Change Data Capture implementation (INSERT / UPDATE / DELETE) using Snowflake Streams with AWS S3 integration.

Snowflake AWS S3 CDC


End-to-end automated data ingestion pipeline using Snowpipe — setup, configuration, and event-based triggering.

Snowflake Snowpipe


Querying and extracting nested JSON data in Snowflake using VARIANT data type and FLATTEN function.

Snowflake JSON VARIANT


🧹 SQL Projects


Real-world SQL data cleaning — handling nulls, duplicates, standardization, and data type corrections.

MS SQL Server Data Cleaning


Advanced SQL analytics on MLB player, team, and school data — window functions, aggregations, and performance insights.

SQL Analytics


Analyzing restaurant menu and order data to surface popular dishes, pricing trends, and customer spending patterns.

SQL Analytics


🐍 Python — Data Cleaning Projects


Real-world Airbnb dataset cleaned using Pandas — handling missing values, outliers, type conversions, and column normalization.

Python Pandas Data Cleaning


Amazon product data cleaned and preprocessed using Pandas — structured for downstream analytics or ML use.

Python Pandas Data Cleaning


📊 Power BI Reports


Interactive HR dashboard covering employee headcount, attrition analysis, departmental performance, and workforce KPIs.

Power BI HR Analytics Interactive


Visual analysis of personality survey data with dynamic slicers, trait distributions, and behavioral pattern breakdowns.

Power BI Survey Analytics Interactive


📫 Contact Me


Always learning, always building — data tells the story, I make it clear.

Popular repositories Loading

  1. snowflake_pacific_retail_end_to_end_project snowflake_pacific_retail_end_to_end_project Public

    End-to-end Snowflake Data Engineering project demonstrating real-time ingestion, CDC with Streams, automated Tasks, data quality pipelines, and Gold layer analytics for enterprise-scale retail anal…

    PLpgSQL 1

  2. snowflake_swiggy_end_to_end_data_pipeline_project snowflake_swiggy_end_to_end_data_pipeline_project Public

    2nd project on Data engineering containing End to end data pipeline.

    PLpgSQL 1

  3. debashisdash1999 debashisdash1999 Public

    Greetings and welcome to my profile.

    1

  4. end_to_end_sql_data_warehouse_prjct end_to_end_sql_data_warehouse_prjct Public

    End-to-end data engineering project using Microsoft SQL Server. Implements a layered data warehouse (Bronze, Silver, Gold) with SQL-based ETL, data cleaning, and star schema modeling from CRM and E…

    TSQL 1

  5. Snowflake-End-to-End-Data-Pipeline-with-Cortex-AI-Dynamic-Tables-Star-Schema-Modeling Snowflake-End-to-End-Data-Pipeline-with-Cortex-AI-Dynamic-Tables-Star-Schema-Modeling Public

    Production-style Snowflake data pipeline implementing Medallion Architecture (Bronze–Silver–Gold), incremental processing using Dynamic Tables, SCD Type 2 dimensions, and a semantic layer enabling …

    1

  6. sql_retail_sale_project sql_retail_sale_project Public

    Retail Sales Analysis Project This repository hosts a PostgreSQL project analyzing retail sales data with a retail_sales table. It features SQL queries for data cleaning, exploration, and insights …