- Built clustering pipelines for automotive data, partitioning, hierarchical, density based.
- Evaluated quality with internal and external metrics, reported findings.
- Applied stratified sampling for representativeness and stable comparisons.
- Ran descriptive and inferential analysis to validate conclusions.
Portfolio
Data Science, Mathematics, and Statistical Modeling.
As a Data Scientist with a strong foundation in mathematics and statistics, I specialize in developing interpretable models and applying rigorous analytical methods to uncover meaningful insights from complex data. I am passionate about bridging theoretical knowledge with practical applications, delivering impactful solutions to real-world problems.
Work experience
- Graded assignments for Foundations of Machine Learning (Chair of Reliable Machine Learning, Apr 2025 – Aug 2025).
- Graded assignments for Analysis I and Analysis II (Chair of Mathematical Analysis, Sep 2023 – Mar 2025).
- Translated and LaTeX-typeset course scripts for Stochastics and Scientific Computing (Chair of Data Assimilation, Sep 2024).
- Translated and LaTeX-typeset the Analysis 3/Integration Theory course script (Chair of Reliable Machine Learning, Jul 2023).
Projects
Neural network assisted 3DVar on Lorenz 63
In this project, we implemented a neural-network-supported 3DVar data assimilation pipeline on the chaotic Lorenz ’63 system. We generated a synthetic twin experiment and computed innovations and 3DVar increments to establish a strong classical baseline. On top of that, we trained a compact MLP (ReLU, Adam, early stopping) to predict analysis increments directly from the background state and observations, and also evaluated a hybrid scheme that blends the learned increment with the 3DVar update. Evaluation included an 80/20 train–validation split, Monte-Carlo studies across multiple observation-noise levels, and a partial-observations setting. Across settings, the learned model consistently reduced mean L2/RMSE versus standard 3DVar. Overall, the approach demonstrated robust improvements in state estimation over classical 3DVar.
Time-to-sell prediction for cars
As part of a team of four, I worked on a project for Audi focused on predicting the time it takes to sell cars. We developed a deep learning model that accurately predicted the timespan, achieving a margin of error of just 9 days, significantly outperforming the baseline of approximately 20 days. To ensure the predictions were interpretable, we also utilized a Generalized Additive Model (GAM), which allowed us to gain valuable insights into the key factors influencing the sales times. The project placed a strong emphasis on handling high-dimensional data and performing comprehensive feature preprocessing to ensure optimal model performance. Our team presented our findings multiple times at Audi, demonstrating the model’s effectiveness and impact. The project was graded with a top score of 1.0.
Fake News Classification
In my second semester, I worked on a project focused on classifying news as fake or true of this Kaggle Fake and Real News Dataset, utilizing both classical machine learning classification methods and modern deep learning approaches. The project required extensive data preprocessing, feature engineering, and careful model evaluation to ensure accuracy and robustness. By applying a combination of traditional techniques and advanced neural networks, I was able to achieve highly reliable results. The final model achieved an accuracy of approximately 99%, demonstrating strong generalization across both classes. The project was graded with the highest possible score of 1.0.
Portfolio site
I designed and built my personal portfolio website with a lightweight, static stack: HTML, CSS, and JavaScript. The UI includes quality-of-life controls such as a white mode and an option to deactivate the animated particles background for improved readability and reduced motion. Content is organized into clear sections (Work Experience, Projects, Education, Skills, Socials), with a downloadable resume and a simple contact form, all wrapped in a responsive layout that works smoothly across devices. I kept dependencies minimal, optimized images and icons, and used semantic markup to keep the site fast and maintainable. Icons are sourced from RemixIcon, Font Awesome, and Devicon, and the whole site is built and styled by me end-to-end.
Education
Master of Science, Mathematics in Data Science
- Focus: Probability theory
- Topics: advanced math, modeling, Python
Bachelor of Science, Data Science
- Specialization: applied math, scientific computing
- Tools: Python, R, MATLAB, Git
- GPA: 1.2