B.Sc. CSIT · Tribhuvan University · 2026 Graduate
A production-style real-time data pipeline that streams live cryptocurrency prices from the CryptoCompare API into Apache Kafka, processes them with a Python consumer, and persists the output for downstream analysis — all containerised with Docker Compose for a one-command setup.
| Field | Type | Description |
|---|---|---|
| coin | string | BTC or ETH symbol |
| price | float | Current USD price |
| timestamp | float | Unix timestamp of fetch |
streaming/
and batch/
folders reflect clear separation of processing modesA production-ready data warehouse built on Microsoft SQL Server using the Medallion Architecture pattern. Raw CRM and ERP data (CSV) flows through three distinct layers — each with well-defined responsibilities — into business-ready Gold tables optimised for reporting and analytics.
load_bronze,
load_silver,
load_gold)
encapsulate each layer's load logic — clean and replayable
sourcesystem_entity)
enforced across all tables for clear data lineagetests/
folder demonstrates awareness of data quality validation