Retrieval-Augmented Employee Analytics Using LangChain and Gemini
In this project, I built a GenAI-powered assistant to explore synthetic HR data through interactive dashboards and natural language queries.
Welcome to my data science playground! I’m a curious, creative, and coffee-fueled data enthusiast who loves uncovering patterns, building smart solutions, and turning messy data into meaningful insights. Whether it’s segmenting customers, forecasting logistics, or detecting fast radio bursts from deep space, I enjoy using machine learning and statistics to solve real-world problems that matter.
This portfolio is a collection of projects that reflect both my technical skills and my love for data storytelling. From predictive models and dashboards to end-to-end pipelines and ML experiments — every project here has taught me something new and pushed me to think smarter.
When I’m not deep in a dataset, you’ll probably find me hosting tech meetups, geeking out over new Python tools, or daydreaming about building a model that predicts when I’ll finally get 8 hours of sleep.
Thanks for stopping by — I hope you find something here that sparks your curiosity too!
In this project, I built a GenAI-powered assistant to explore synthetic HR data through interactive dashboards and natural language queries.
A machine learning project to predict loan defaults using real-world LendingClub data and risk modeling techniques.
In this project, I performed customer segmentation on retail transaction data to identify key customer groups for targeted marketing.
In this project, I trained a BERT model using PyTorch to tokenize input sentences and accurately identify named entities, exploring how transformer-based models perform sequence labeling tasks.
In this project, Galaxy zoo images were used to train and test a network to classify elliptical and spiral galaxies.>
In this project, featues of a Kickstarter project such as the category, country, currency, the pleded amount (in USD and in the given currency), the deadline for the completion of the project is used to predict the state of the project (whether it is successful or not).
A collection of 30,000 audio files was sourced from the Marinexplore and Cornell University Whale Detection Challenge to classify clips containing whale sounds from those without any.