Available for Full-Time Roles

DATA
ENGINEER
ENTRY LEVEL

Building scalable data pipelines, transforming raw data into actionable insights.

View Projects Portfolio View Projects Get In Touch
INGEST PROCESS STORE KAFKA SPARK S3/GCS APIs/DBs ETL/ELT Warehouse Streaming Transform Lake/DW END-TO-END PIPELINE
MS SQL IBM Db2 Python Apache Kafka Apache Airflow Docker MySQL Medallion Architecture Inmon Kimbaff Big Data MS SQL IBM Db2 Python Apache Kafka Apache Airflow Docker MySQL Medallion Architecture Inmon Kimbaff Big Data

BUILDING THE
DATA LAYER

Recent Computer Science graduate with a deep focus on data infrastructure and pipeline engineering. I thrive at the intersection of software engineering and data — turning raw, messy data into reliable, scalable systems.

Currently seeking entry-level Data Engineering roles where I can contribute immediately and grow alongside a team obsessed with data quality and reliability.

dipesh_profile.py
class DataEngineer:
def __init__(self):
self.name = "Dipesh Luitel"
self.role = "Data Engineer"
self.experience = "Entry Level"
self.education = "Bsc. CSIT"
self.skills = [
"Python", "Spark",
"dbt", "Airflow",
"SQL", "Kafka", "SSIS"
]
self.open_to_work = True # ← hire me!
 
$ python dipesh_profile.py
> Ready to build pipelines. 🚀
$
Architecture I Work With

THE DATA
PIPELINE

🗄️
SOURCE
Raw Data
REST · DB · CSV
📥
INGEST
Collection
Kafka · SSIS · SQL
⚙️
TRANSFORM
Processing
Spark · MS SQL Server
🏛️
WAREHOUSE
Serving Layer
Star Schema
📊
CONSUME
Insights
BI · ML · Dash

WHAT I'VE
BUILT

01 Completed

SQL Data Warehouse Project

Architected a production-ready data warehouse using Medallion Architecture (Bronze → Silver → Gold layers), ingesting raw CRM and ERP data through a full-load ETL batch pipeline into Microsoft SQL Server. Designed normalized data models across Bronze (raw), Silver (cleaned/standardized), and Gold (business-ready), layers — implementing stored procedures (load_bronze, load_silver, load_gold) for automated layer loading. Enforced data governance best practices including naming conventions, deduplication, missing value handling, and schema validation across all pipeline stages.

SQL Server ETL pipelines Medallion Architecture Data Modeling Star Schema Draw.io
02 Completed

Real-Time Crypto Data Platform

Engineered an end-to-end real-time and historical data pipeline for cryptocurrency prices, supporting both streaming (Kafka) and batch analytics workflows — demonstrating core data engineering pipeline design.Designed Kafka producer/consumer architecture with coin-keyed partitioning; ingested 30 days of historical OHLCV data as structured CSVs; containerized the full infrastructure using Docker Compose for reproducible deployment.

Kafka Python Docker Pandas PostgreSQL Zookeeper
Background

EXPERIENCE &
EDUCATION

Jan 2026 — Mar 2026
SQL Developer Intern
Pratham IT System Pvt. Ltd

Designed and executed SQL queries on PostgreSQL and SQL Server for data extraction, transformation, and reporting — contributing directly to ETL reporting workflows under developer mentorship.Identified and resolved data inconsistencies in application datasets, enforcing data quality and governance standards to improve reliability of downstream analytical outputs. Collaborated with the development team through Git-based version control, contributing to structured software delivery and code review processes.

SQL Server ETL PostgreSQL Git Python
2022 — 2026
B.Sc. Computer Science and Information Technology
Tribhuwan University
Let's Connect

LET'S BUILD
PIPELINES
TOGETHER

Open to Data Engineering roles, internships, and collaborations. Let's talk about data infrastructure.