Daan Wierstra

Daan Wierstra is a Dutch computer scientist and the first research scientist hired at DeepMind, a co-author of the DDPG continuous-control paper, and a former co-founder of H who returned to research after the late-2024 founder departures.
Daan Wierstra

Bio

Daan Wierstra is a Dutch computer scientist and senior researcher in deep reinforcement learning, evolutionary methods for neural networks, and generative modeling. He was the first research scientist hired at DeepMind in 2011 and a senior figure across more than a decade of the company's research output, with co-authorship credits including the DDPG continuous-control paper and contributions to the variational autoencoder line. He was a co-founder of H, the Paris-headquartered agentic-AI company he established in 2023 with Charles Kantor, Laurent Sifre, Karl Tuyls, and Julien Perolat, before departing in late 2024 alongside two of the other co-founders. As of May 2026, his published record reflects more than two decades of contributions to neural-network research at IDSIA and DeepMind.

At a glance

Origins

Wierstra completed a bachelor's degree in computer science at Utrecht University in the Netherlands in 2004 before moving to the Dalle Molle Institute for Artificial Intelligence (IDSIA) in Lugano, Switzerland for doctoral work under Jürgen Schmidhuber. The IDSIA group, run by Schmidhuber alongside Faustino Gomez, Tom Schaul, and a senior cohort of researchers in evolutionary methods and recurrent-neural-network training, produced a substantial proportion of the senior deep-learning research diaspora that staffed DeepMind in its founding period.

The doctoral research program addressed evolutionary methods for neural-network optimization. The principal artifact from the IDSIA period is Natural Evolution Strategies, the family of black-box optimization algorithms that use the natural gradient to update a parameterized search distribution toward higher expected fitness. The technique was developed with Tom Schaul, Tobias Glasmachers, Yi Sun, and Schmidhuber and published in the Journal of Machine Learning Research in 2014.

Career

Wierstra was the first research scientist hired at DeepMind in 2011, joining the company's earliest research staff alongside the co-founders Demis Hassabis, Shane Legg, and Mustafa Suleyman. The DeepMind group at that point had not yet shipped the Atari deep-Q-network results that would establish the company's research profile, and the early research program built on the IDSIA-and-Schmidhuber-school recurrent-neural-network and evolutionary-methods lineage that several DeepMind early hires shared with Alex Graves and Tom Schaul.

The DeepMind tenure produced a substantial research record. The 2014 paper Stochastic Backpropagation and Approximate Inference in Deep Generative Models by Danilo Rezende, Shakir Mohamed, and Wierstra introduced one of the early formulations of variational inference for deep generative models, parallel to the Auto-Encoding Variational Bayes paper by Diederik Kingma and Max Welling at the University of Amsterdam. The two papers together formalized the variational-autoencoder family that became foundational across deep-generative-modeling research.

The 2015 paper Continuous control with deep reinforcement learning, with Timothy Lillicrap, Jonathan Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Wierstra as co-authors, introduced the Deep Deterministic Policy Gradient (DDPG) algorithm. DDPG is an actor-critic, model-free method based on the deterministic policy gradient that operates over continuous action spaces; the same paper described robust solutions across more than 20 simulated physics tasks. The DDPG line became a foundational reference for continuous-control reinforcement learning across the late 2010s.

Subsequent contributions in the DeepMind period spanned relational recurrent neural networks, imagination-augmented agents, world models, scheduling problems, embodied agents, and scalability in simulated environments. Wierstra rose to Principal Scientist during the tenure.

In 2023 Wierstra left DeepMind to co-found H in Paris with Charles Kantor, Laurent Sifre, Karl Tuyls, and Julien Perolat. The May 2024 $220 million seed round, led by Accel with Amazon, UiPath, Bpifrance, Innovation Endeavors, FirstMark, and Bernard Arnault participating, was the largest AI seed in European history at the time. The five-co-founder configuration combined Kantor's academic-and-finance background with four senior DeepMind veterans.

In late 2024, Wierstra, Tuyls, and Perolat departed H over what the company described as "operational differences." Per Sifted and other industry coverage, the three founders left H within months of each other in the second half of 2024, leaving Sifre and Kantor as the remaining co-founders. The departures were reported as a structurally significant first-year governance event for the company. Per public reporting, Wierstra returned to senior research roles after the H departure.

Affiliations

Notable contributions

  • Natural Evolution Strategies (Journal of Machine Learning Research, 2014). Co-authored with Tom Schaul, Tobias Glasmachers, Yi Sun, and Jürgen Schmidhuber, the paper introduced the family of black-box optimization algorithms using the natural gradient on parameterized search distributions.
  • Stochastic Backpropagation and Approximate Inference in Deep Generative Models (ICML 2014). Co-authored with Danilo Rezende and Shakir Mohamed, the paper introduced one of the principal early formulations of variational inference for deep generative models. The variational-autoencoder line built on this and the parallel Kingma-Welling paper became foundational across deep generative modeling.
  • Continuous control with deep reinforcement learning (DDPG, ICLR 2016). Co-authored with Lillicrap, Hunt, Pritzel, Heess, Erez, Tassa, and Silver, the paper introduced the DDPG continuous-control algorithm, which became a foundational reference for actor-critic continuous-control reinforcement learning.
  • DeepMind early-tenure research output. First research scientist at DeepMind in 2011, contributing across recurrent neural networks, world models, relational reasoning, imagination-augmented agents, and reinforcement-learning foundations.
  • H co-founding (2023). Co-founded the Paris-headquartered agentic-AI company.

Position in the field

As of May 2026, Wierstra occupies a distinctive position as a senior deep-learning researcher with co-authorship credit on three of the structurally foundational research artifacts of the contemporary deep-learning era (Natural Evolution Strategies, the variational-autoencoder line, and DDPG) and a more than decade-long DeepMind tenure that placed him at the company from its earliest research staffing. The IDSIA-school lineage through Schmidhuber, alongside fellow alumni Alex Graves and Tom Schaul, ties him to one of the principal academic origin nodes of contemporary deep learning.

The H founding tenure and subsequent departure place Wierstra inside the post-DeepMind senior-research diaspora that has staffed several European AI insurgents through the 2023 to 2026 period. Industry coverage has characterized the H departures as a structurally significant first-year governance event, with the subsequent trajectories of Wierstra, Tuyls, and Perolat as a watchable signal for the senior research-talent dynamics of the European AI ecosystem.

Outlook

Open questions over the next 6 to 18 months:

  • Future research role. Whether Wierstra returns to a senior research role at a frontier lab, an academic position, or another insurgent venture.
  • Continued publication record. Continued co-authored research output following the H departure.
  • Network with H peers. The trajectory of the broader Wierstra-Tuyls-Perolat post-departure cohort and any joint subsequent ventures.
  • DeepMind alumni network dynamics. The senior research-talent dynamics of the post-DeepMind diaspora through 2026 and 2027.

Sources

About the author
Nextomoro

Nextomoro

nextomoro tracks progress for AI research labs, models, and what's next.

AI Research Lab Intelligence

nextomoro tracks progress for AI research labs, models, and what's next.

AI Research Lab Intelligence

Great! You’ve successfully signed up.

Welcome back! You've successfully signed in.

You've successfully subscribed to AI Research Lab Intelligence.

Success! Check your email for magic link to sign-in.

Success! Your billing info has been updated.

Your billing was not updated.