Portfolio

Data Science, Mathematics, and Statistical Modeling.

As a Data Scientist with a strong foundation in mathematics and statistics, I specialize in developing interpretable models and applying rigorous analytical methods to uncover meaningful insights from complex data. I am passionate about bridging theoretical knowledge with practical applications, delivering impactful solutions to real-world problems.

4
Core areas
6+
Projects shipped
Speed
Fast adaptation
Machine Learning Statistics Deep Learning Model Interpretability
Portrait placeholder
Project visual placeholder
Chart placeholder

Work experience

Data Science Intern
02/2025 to 04/2025
BMW AG, Munich
  • Built clustering pipelines for automotive data, partitioning, hierarchical, density based.
  • Evaluated quality with internal and external metrics, reported findings.
  • Applied stratified sampling for representativeness and stable comparisons.
  • Ran descriptive and inferential analysis to validate conclusions.
Python Clustering Statistics Reporting
Student Assistant
10/2023 to 04/2025
Catholic University of Eichstätt-Ingolstadt, Ingolstadt
  • Graded assignments for Foundations of Machine Learning (Chair of Reliable Machine Learning, Apr 2025 – Aug 2025).
  • Graded assignments for Analysis I and Analysis II (Chair of Mathematical Analysis, Sep 2023 – Mar 2025).
  • Translated and LaTeX-typeset course scripts for Stochastics and Scientific Computing (Chair of Data Assimilation, Sep 2024).
  • Translated and LaTeX-typeset the Analysis 3/Integration Theory course script (Chair of Reliable Machine Learning, Jul 2023).
LaTeX Grading

Projects

Project image placeholder
Machine learning

Neural network assisted 3DVar on Lorenz 63

In this project, we implemented a neural-network-supported 3DVar data assimilation pipeline on the chaotic Lorenz ’63 system. We generated a synthetic twin experiment and computed innovations and 3DVar increments to establish a strong classical baseline. On top of that, we trained a compact MLP (ReLU, Adam, early stopping) to predict analysis increments directly from the background state and observations, and also evaluated a hybrid scheme that blends the learned increment with the 3DVar update. Evaluation included an 80/20 train–validation split, Monte-Carlo studies across multiple observation-noise levels, and a partial-observations setting. Across settings, the learned model consistently reduced mean L2/RMSE versus standard 3DVar. Overall, the approach demonstrated robust improvements in state estimation over classical 3DVar.

PyTorch Data assimilation Evaluation
Project image placeholder
Applied ML

Time-to-sell prediction for cars

As part of a team of four, I worked on a project for Audi focused on predicting the time it takes to sell cars. We developed a deep learning model that accurately predicted the timespan, achieving a margin of error of just 9 days, significantly outperforming the baseline of approximately 20 days. To ensure the predictions were interpretable, we also utilized a Generalized Additive Model (GAM), which allowed us to gain valuable insights into the key factors influencing the sales times. The project placed a strong emphasis on handling high-dimensional data and performing comprehensive feature preprocessing to ensure optimal model performance. Our team presented our findings multiple times at Audi, demonstrating the model’s effectiveness and impact. The project was graded with a top score of 1.0.

Deep learning GAM Feature engineering
Project image placeholder
NLP

Fake News Classification

In my second semester, I worked on a project focused on classifying news as fake or true of this Kaggle Fake and Real News Dataset, utilizing both classical machine learning classification methods and modern deep learning approaches. The project required extensive data preprocessing, feature engineering, and careful model evaluation to ensure accuracy and robustness. By applying a combination of traditional techniques and advanced neural networks, I was able to achieve highly reliable results. The final model achieved an accuracy of approximately 99%, demonstrating strong generalization across both classes. The project was graded with the highest possible score of 1.0.

NLP Transformers Validation
Project image placeholder
Engineering

Portfolio site

I designed and built my personal portfolio website with a lightweight, static stack: HTML, CSS, and JavaScript. The UI includes quality-of-life controls such as a white mode and an option to deactivate the animated particles background for improved readability and reduced motion. Content is organized into clear sections (Work Experience, Projects, Education, Skills, Socials), with a downloadable resume and a simple contact form, all wrapped in a responsive layout that works smoothly across devices. I kept dependencies minimal, optimized images and icons, and used semantic markup to keep the site fast and maintainable. Icons are sourced from RemixIcon, Font Awesome, and Devicon, and the whole site is built and styled by me end-to-end.

HTML CSS JS

Education

Master of Science, Mathematics in Data Science

Technical University of Munich, Munich

10/2025 to present

  • Focus: Probability theory
  • Topics: advanced math, modeling, Python

Bachelor of Science, Data Science

Catholic University Eichstätt-Ingolstadt, Eichstätt

10/2022 to 09/2025

  • Specialization: applied math, scientific computing
  • Tools: Python, R, MATLAB, Git
  • GPA: 1.2

Skills

Core
Python MATLAB R NumPy pandas Matplotlib Scikit-Learn
ML
PyTorch TensorFlow Keras XGBoost Interpretability
Engineering
Git GitHub Linux Docker pytest SQL MySQL
Web
HTML CSS
Cloud
AWS
Writing
LaTeX
Python
Python
MATLAB
MATLAB
R
R
NumPy
NumPy
pandas
pandas
Matplotlib
Matplotlib
Scikit-Learn
Scikit-Learn
PyTorch
PyTorch
TensorFlow
TensorFlow
Keras
Keras
Git
Git
GitHub
GitHub
Linux
Linux
MySQL
MySQL
HTML
HTML
CSS
CSS
AWS
AWS
LaTeX
LaTeX

Want the full story behind a project?

Send me a message!

Contact

Don't hesitate to reach out!

Email
mail(at)janstuewe.com
Location
Munich, Germany
Links