Hi,
I am Ronit Roy
Data Engineer
Download CV
✨ About Me
I'm Ronit Roy, a passionate data-driven problem solver who transforms information into real-world impact.
Currently working as a Data Engineer at Saayam For All, I specialize in building scalable pipelines and real-time analytics systems that drive meaningful outcomes.
I recently earned my Master’s in Applied Data Analytics from Boston University, mastering the craft of robust data architectures, cloud-native workflows, and machine learning pipelines.
With a strong command of Python, SQL, Airflow, Spark, and AWS, I love solving complex problems, collaborating with teams, and creating systems that make data truly valuable.
💼 Experience
Data Engineer
Saayam For All Mar 2025 – PresentResearch Assistant
GLOB S Research Lab Oct 2023 – May 2024- Built Airflow + PostgreSQL pipelines improving processing by 25%.
- Developed NER pipelines (95% accuracy) for healthcare docs using Docker + Python.
- Optimized SQL queries (30% faster) and created dashboard insights.
Machine Learning Intern
HighRadius Jan 2022 – Apr 2022- Engineered CNNs for fraud detection, boosting accuracy by 30%.
- Created ETL pipelines in Python, integrated with Snowflake.
- Deployed Flask APIs via Docker & Jenkins CI/CD pipelines.
🛠️ Projects
Spotify Data Pipeline
Python, Snowflake, AWS, Airflow
- End-to-end ETL for 1M+ records/day, optimized with Airflow DAGs.
- Enabled real-time data flow using AWS Lambda & Snowpipe.
Customer Data Lake
Python, Spark, AWS S3
- Handled 10+ TB historical retail data using Spark, partitioning & compression.
E-commerce Recommendation System
TensorFlow, Flask, PostgreSQL
- Collaborative filtering model boosted engagement by 20%.
- Served real-time results via Flask API.
Advanced ETL for Retail Analytics
Kafka, Airflow, Snowflake, Redshift
- Stream-processed 500K+ records/hour with automated failure recovery.
- Real-time insights via Redshift + optimized data warehousing.
💡Skills
Professional Skills
🎓 Education
Boston University
M.S. in Applied Data Analytics Sept 2023 – Dec 2024Relevant Coursework: Advanced Machine Learning, Database Management, Data Mining
SRM Institute of Science and Technology
B.Tech in Computer Science Jul 2019 – Apr 2023Relevant Coursework: DSA, Probability & Stats, Data Visualization