Dipesh Luitel — Data Engineer

▲MS SQL ▲IBM Db2 ▲Python ▲Apache Kafka ▲Apache Airflow ▲Docker ▲MySQL ▲Medallion Architecture ▲Inmon ▲Kimbaff ▲Big Data ▲MS SQL ▲IBM Db2 ▲Python ▲Apache Kafka ▲Apache Airflow ▲Docker ▲MySQL ▲Medallion Architecture ▲Inmon ▲Kimbaff ▲Big Data

About Me

BUILDING THE
DATA LAYER

Recent Computer Science graduate with a deep focus on data infrastructure and pipeline engineering. I thrive at the intersection of software engineering and data — turning raw, messy data into reliable, scalable systems.

Currently seeking entry-level Data Engineering roles where I can contribute immediately and grow alongside a team obsessed with data quality and reliability.

dipesh_profile.py

class DataEngineer:

def __init__(self):

self.name = "Dipesh Luitel"

self.role = "Data Engineer"

self.experience = "Entry Level"

self.education = "Bsc. CSIT"

self.skills = [

"Python", "Spark",

"dbt", "Airflow",

"SQL", "Kafka", "SSIS"

]

self.open_to_work = True # ← hire me!

$ python dipesh_profile.py

> Ready to build pipelines. 🚀

Architecture I Work With

THE DATA
PIPELINE

🗄️

SOURCE

Raw Data

REST · DB · CSV

📥

INGEST

Collection

Kafka · SSIS · SQL

⚙️

TRANSFORM

Processing

Spark · MS SQL Server

🏛️

WAREHOUSE

Serving Layer

Star Schema

📊

CONSUME

Insights

BI · ML · Dash

Featured Projects

WHAT I'VE
BUILT

01 Completed

SQL Data Warehouse Project

Architected a production-ready data warehouse using Medallion Architecture (Bronze → Silver → Gold layers), ingesting raw CRM and ERP data through a full-load ETL batch pipeline into Microsoft SQL Server. Designed normalized data models across Bronze (raw), Silver (cleaned/standardized), and Gold (business-ready), layers — implementing stored procedures (load_bronze, load_silver, load_gold) for automated layer loading. Enforced data governance best practices including naming conventions, deduplication, missing value handling, and schema validation across all pipeline stages.

SQL Server ETL pipelines Medallion Architecture Data Modeling Star Schema Draw.io

GitHub

02 Completed

Real-Time Crypto Data Platform

Engineered an end-to-end real-time and historical data pipeline for cryptocurrency prices, supporting both streaming (Kafka) and batch analytics workflows — demonstrating core data engineering pipeline design.Designed Kafka producer/consumer architecture with coin-keyed partitioning; ingested 30 days of historical OHLCV data as structured CSVs; containerized the full infrastructure using Docker Compose for reproducible deployment.

Kafka Python Docker Pandas PostgreSQL Zookeeper

GitHub

Background

EXPERIENCE &
EDUCATION

Jan 2026 — Mar 2026

SQL Developer Intern

Pratham IT System Pvt. Ltd

Designed and executed SQL queries on PostgreSQL and SQL Server for data extraction, transformation, and reporting — contributing directly to ETL reporting workflows under developer mentorship.Identified and resolved data inconsistencies in application datasets, enforcing data quality and governance standards to improve reliability of downstream analytical outputs. Collaborated with the development team through Git-based version control, contributing to structured software delivery and code review processes.

SQL Server ETL PostgreSQL Git Python

2022 — 2026

B.Sc. Computer Science and Information Technology

Tribhuwan University

DATA
ENGINEER
ENTRY LEVEL

BUILDING THE
DATA LAYER

THE DATA
PIPELINE

WHAT I'VE
BUILT

SQL Data Warehouse Project

Real-Time Crypto Data Platform

EXPERIENCE &
EDUCATION

LET'S BUILD
PIPELINES
TOGETHER

DATA ENGINEER ENTRY LEVEL

BUILDING THEDATA LAYER

THE DATAPIPELINE

WHAT I'VEBUILT

SQL Data Warehouse Project

Real-Time Crypto Data Platform

EXPERIENCE &EDUCATION

LET'S BUILDPIPELINESTOGETHER

DATA
ENGINEER
ENTRY LEVEL

BUILDING THE
DATA LAYER

THE DATA
PIPELINE

WHAT I'VE
BUILT

EXPERIENCE &
EDUCATION

LET'S BUILD
PIPELINES
TOGETHER