Hi,
I am Ronit Roy
Data Engineer

Download CV

✨ About Me

I'm Ronit Roy, a passionate data-driven problem solver who transforms information into real-world impact.

Currently working as a Data Engineer at Saayam For All, I specialize in building scalable pipelines and real-time analytics systems that drive meaningful outcomes.

I recently earned my Master’s in Applied Data Analytics from Boston University, mastering the craft of robust data architectures, cloud-native workflows, and machine learning pipelines.

With a strong command of Python, SQL, Airflow, Spark, and AWS, I love solving complex problems, collaborating with teams, and creating systems that make data truly valuable.

💼 Experience

Data Engineer

Saayam For All Mar 2025 – Present

Research Assistant

GLOB S Research Lab Oct 2023 – May 2024
  • Built Airflow + PostgreSQL pipelines improving processing by 25%.
  • Developed NER pipelines (95% accuracy) for healthcare docs using Docker + Python.
  • Optimized SQL queries (30% faster) and created dashboard insights.

Machine Learning Intern

HighRadius Jan 2022 – Apr 2022
  • Engineered CNNs for fraud detection, boosting accuracy by 30%.
  • Created ETL pipelines in Python, integrated with Snowflake.
  • Deployed Flask APIs via Docker & Jenkins CI/CD pipelines.

🛠️ Projects

Spotify Data Pipeline

Python, Snowflake, AWS, Airflow

  • End-to-end ETL for 1M+ records/day, optimized with Airflow DAGs.
  • Enabled real-time data flow using AWS Lambda & Snowpipe.

Customer Data Lake

Python, Spark, AWS S3

  • Handled 10+ TB historical retail data using Spark, partitioning & compression.

E-commerce Recommendation System

TensorFlow, Flask, PostgreSQL

  • Collaborative filtering model boosted engagement by 20%.
  • Served real-time results via Flask API.

Advanced ETL for Retail Analytics

Kafka, Airflow, Snowflake, Redshift

  • Stream-processed 500K+ records/hour with automated failure recovery.
  • Real-time insights via Redshift + optimized data warehousing.

💡Skills

Professional Skills

SQl
95%
Python
90%
Data Analysis
90%
ETL
90%

🎓 Education

Boston University

M.S. in Applied Data Analytics Sept 2023 – Dec 2024

Relevant Coursework: Advanced Machine Learning, Database Management, Data Mining

SRM Institute of Science and Technology

B.Tech in Computer Science Jul 2019 – Apr 2023

Relevant Coursework: DSA, Probability & Stats, Data Visualization

📨 Contact