About

Hi, I’m Thomas! I’m currently an M2 mathematics student at Télécom SudParis and ENS Paris-Saclay, and soon, an AI researcher.

I began doing research in AI security two years ago. After taking a gap year, where I studied Large Language Model (LLM) security and LLM interpretability, I’m now turning my attention toward more fundamental AI. I also love graph theory and physics.

My favorite paper: Toy Models of Superposition.

You can find me on:

GitHub / Sckathach
LinkedIN / thomas-winninger
X / sckathach

You can contact me at: thomas [dot] winninger [at] telecom-sudparis [dot] eu

Download PDF resume: in English.

See my Research work.

Education

2025 - 2026 - Master MVA, ENS Paris-Saclay
Topology, optimal transport, reinforcement learning, training and deploying large-scale models, LLM, graph neural networks, learning for protein science, convex optimization.
2022 - 2026 - Engineering Degree, Télécom SudParis
Telecommunications, cyber security, cloud, information theory, probability, optimization, graph theory, graph neural networks, signal processing.

Experience

Jul - Sep 2025 - Research internship in LLM security - NICT
Research on the security and jailbreak interpretability of Large Reasoning Models (LRMs). I studied LRM robustness, adapted state-of-the-art black-box and white-box attack from LLMs, and started studying jailbreaks with interpretability methods on LRMs.
Mar - May 2025 - Research internship in AI explanability - INRIA
Verified robust explanation for language models. I explored scaling Hybrid Constrained Zonotopes (HCZs) to language models using convex relaxation and optimization. However, the relaxation error proved too large for practical use.
Jul - Dec 2024 - Research internship in AI security - Thales
Implementations and improvements of state-of-the-art attacks on LLMs. I improved state-of-the-art white-box adversarial attacks on LLMs and published the results on ArXiv.
2022 - 2024 - Teaching and infrastructure - HackademINT
Teaching (cloud and AI security), cloud management (Kubernetes), creation of challenges (AI & quantum physics), and organization of 404CTF 2023 & 2024 (largest cyber security competition in France).

Miscellaneous

Languages: Python, French, OCaml, English, Typst, TypeScript, Lua, Rust, C, Bash, Japanese (JLPT 4), Lean.
Tools/ Frameworks: PyTorch, nnsight, Docker (Podman), Kubernetes, React, Qiskit, Archlinux.
Other interests: Piano, guitar, teaching, reading, geopolitics, particle physics, sports, video game (playing & development), meditation.
I completed the Alignment Research Engineer Accelerator (ARENA) and the AI Safety Fundamental (AISF) curriculums.