Skip to content

Thusharkanth/Data_Engineering_CW

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Data Engineering Coursework

πŸ“ Assessment Overview

Module: CM2606 Data Engineering
Deadline: 18th April 2025 Submission:

  • Report
  • Video demonstration (for Question 1)

Key Skills Developed:
βœ… Building ETL pipelines with Airflow
βœ… Data modeling (Star Schema)
βœ… Cloud security & compliance (HIPAA/GDPR)


πŸ“‚ Repository Contents

Data_Engineering_Coursework/
β”œβ”€β”€ CM2606-CW-2025.pdf            # Original coursework brief
β”œβ”€β”€ Data Engineering CW Report.pdf # Submitted report (contains all answers)
└── README.md                     # This overview file

πŸŽ₯ Question 1: ETL Pipeline Demo

Video Thumbnail
Click image to watch the AWS Airflow implementation

Components Implemented:

  • OpenWeather API extraction
  • Data transformation to Parquet/CSV
  • S3 loading with Airflow DAGs
  • Automated scheduling

πŸ“Š Question 2: SuperMart Data Model

Star Schema Components:

  1. Dimension Tables
  2. Fact Table
  3. Aggregate Tables

*Refer to report for diagram and other details *


πŸ”’ Question 3: Healthcare Security Framework

1. Authentication & Authorization

  • Implement strong authentication (MFA/OAuth)
  • Design granular authorization using:
    • Role-Based Access Control (RBAC) or
    • Attribute-Based Access Control (ABAC)

2. Data Protection

  • At-rest encryption: AES-256 standard
  • In-transit encryption: TLS 1.3
  • Sensitive data handling:
    • Column-level security for PII
    • Dynamic data masking

3. Compliance Measures

  • Implement audit trails (e.g., AWS CloudTrail)
  • Establish data retention policies
  • Real-time monitoring (SIEM solutions)

4. Third-Party Access

  • Secure API gateway implementation
  • Token-based authentication
  • Rate limiting mechanisms
  • Aggregated data exposure only

Submitted by: [Loganthan Thusharkanth] (Student ID: [20233168])

About

Data_Engineering_Coursework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors