Ashish Vaswani

Ashish Vaswani is an Indian computer scientist, co-founder and chief executive officer of Essential AI, and the first-listed author of the 2017 Google Brain paper that introduced the transformer architecture.
Ashish Vaswani

Bio

Ashish Vaswani is an Indian computer scientist and entrepreneur, born in 1986 in India. He is the co-founder and chief executive officer of Essential AI, the San Francisco-based foundation-model company he established in 2023 with Niki Parmar, and the first-listed author of the June 2017 Google Brain paper "Attention Is All You Need", which introduced the transformer architecture that underlies essentially every contemporary frontier large language model. As of May 2026, he leads Essential AI following the company's $175 million Series B at a $1 billion valuation and the release of the open-weights Rnj-1 (Ramanujan) model line.

At a glance

Origins

Vaswani was born in 1986 in India and grew up in the country before pursuing his undergraduate studies. He completed a Bachelor of Technology in computer science at the Birla Institute of Technology, Mesra in 2002, then moved to the United States for graduate studies. He enrolled at the University of Southern California in 2004 for doctoral work in computer science, where he was supervised by David Chiang at the Information Sciences Institute. His doctoral research focused on natural language processing, statistical machine translation, and the application of neural networks to language modeling.

After completing his PhD, Vaswani spent approximately two years as a computer scientist in the Natural Language Group at USC's Information Sciences Institute before moving to industry research.

Career

Vaswani joined Google Brain in 2016 as a research scientist, where he worked on neural network architectures for natural language processing. The defining research artifact from his Google Brain period is "Attention Is All You Need", submitted to arXiv in June 2017 and presented at NeurIPS 2017. The paper introduced the transformer architecture, replacing the recurrent and convolutional building blocks of prior sequence-to-sequence models with a self-attention mechanism applied across the input sequence. Vaswani is the first-listed of eight authors, who include Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin.

The paper is among the most-cited research artifacts in machine learning. Subsequent transformer-derived architectures form the backbone of BERT, GPT-2, GPT-3, GPT-4, Claude, Gemini, Llama, and the broader frontier-foundation-model cohort. The seven other co-authors have collectively founded or led senior research positions at Character.AI, Cohere, Sakana AI, Adept AI, Inceptive, Essential AI, and other AI organizations, in a pattern of post-Google research-led founder activity that has been characterized in industry coverage as the "transformer eight" diaspora.

Vaswani left Google in 2021 to co-found Adept AI with Niki Parmar and David Luan, the former vice president of engineering at OpenAI and a former Google director. Adept positioned itself as a research-and-product company building action-taking models for enterprise software automation. Vaswani served as chief scientist. He and Parmar departed Adept in 2022 to pursue a new research-led foundation-model thesis. The senior leadership of Adept subsequently transitioned to Amazon in 2024 in a partnership-and-licensing arrangement.

In 2023, Vaswani co-founded Essential AI with Niki Parmar, headquartered in San Francisco. The company emerged from stealth in December 2023 with a $56.5 million Series A led by March Capital with strategic investors Google, Nvidia, AMD, KB Investment, Franklin Venture Partners, and Thrive Capital. Earlier seed capital of $8.3 million from Thrive Capital had funded the 2023 stealth period. The strategic-investor cohort across Google, Nvidia, and AMD was unusual in its breadth and signaled the depth of strategic interest in the founder pair's research output.

The 2024 to 2025 period at Essential AI was comparatively quiet on public-product launches as the team focused on foundation-model research and full-stack enterprise-automation prototypes. The open-weights Rnj-1 (Ramanujan) base and instruction-tuned models were released in late 2025 as Essential AI's first contribution to the open-source canon. A $175 million Series B at a $1 billion post-money valuation, led by Lightspeed Venture Partners with Thrive Capital, brought the company to unicorn status and put cumulative private capital above $240 million as of early 2026.

Affiliations

  • University of Southern California Information Sciences Institute: Doctoral candidate and computer scientist, 2004 to approximately 2016.
  • Google Brain: Research scientist, 2016 to 2021.
  • Adept AI: Co-founder and chief scientist, 2021 to 2022.
  • Essential AI: Co-founder and Chief Executive Officer, 2023 to present.

Notable contributions

  • Attention Is All You Need (June 2017). First-listed author of the Google Brain transformer paper, which replaced recurrent and convolutional sequence models with self-attention as the principal mechanism for sequence-to-sequence modeling. The paper is among the most-cited in modern machine learning and the conceptual ancestor of the contemporary frontier-foundation-model architectures.
  • Tensor2Tensor (2017). Co-author of the open-source Tensor2Tensor library released by Google Brain alongside the transformer paper, which became one of the early reference implementations of the transformer architecture.
  • Adept AI co-founding (2021). Co-founded the enterprise-action-model company with Niki Parmar and David Luan.
  • Essential AI co-founding (2023). Co-founder and Chief Executive Officer of the San Francisco foundation-model company.
  • Rnj-1 (Ramanujan) (2025). Open-weights base and instruction-tuned language models released as Essential AI's first contribution to the open-source canon.

Position in the field

Vaswani occupies a distinctive position among contemporary AI lab founders. The first-listed authorship of the transformer paper, combined with the founder-and-chief-executive role at a unicorn-tier foundation-model company, is unusual within the broader frontier-and-insurgent founder cohort. Industry coverage has consistently characterized Essential AI as one of the watchable post-frontier insurgent labs of the 2023 cohort, with the Vaswani-and-Parmar transformer credibility providing distinctive recruiting leverage among research engineers and senior researchers.

The "transformer eight" pattern of post-Google research-led founder activity has produced a distinctive cohort of insurgent AI labs and senior research positions, including Noam Shazeer at Character.AI (subsequently rejoining Google in 2024), Aidan N. Gomez at Cohere, Llion Jones at Sakana AI, Jakob Uszkoreit at Inceptive, Niki Parmar at Anthropic, and Vaswani at Essential AI. Among that cohort, Vaswani's first-listed authorship and continuing chief-executive role at a foundation-model lab give him the most direct claim on the architecture's continuing development.

The September 2025 Bloomberg Businessweek profile of Vaswani characterized his strategic premise as a critique of the dominant transformer-scaling thesis, with Essential AI positioned as a research-led counterweight to the closed-frontier labs. The premise places Vaswani in a distinctive editorial position: a senior figure of the transformer era making the public case for research and architectural innovation beyond pure parameter-and-compute scaling.

Outlook

Open questions and watchable signals over the next 6 to 18 months:

  • Essential AI product surface. The pace and form of the in-development full-stack enterprise-automation product line, and the conversion from foundation-model research credibility to enterprise-automation revenue at scale.
  • Continued Rnj-family releases. Whether the Rnj-1 (Ramanujan) line is followed by larger or differently positioned open-weights models, and the strategic posture toward continued open-weights distribution alongside potential closed-weights commercial models.
  • Series C and adjacent fundraising. The next pricing event beyond the early-2026 Series B unicorn round.
  • Strategic-investor relationships. Whether the Google-Nvidia-AMD strategic-investor cohort converts into meaningful distribution channels or remains primarily a capital-and-compute relationship.
  • Senior research recruiting. Continued senior research-and-engineering hiring against the post-frontier insurgent cohort, including Mistral AI, Reka AI, and the broader insurgent-foundation-model peer set.

Sources

About the author
Nextomoro

Nextomoro

nextomoro tracks progress for AI research labs, models, and what's next.

AI Research Lab Intelligence

nextomoro tracks progress for AI research labs, models, and what's next.

AI Research Lab Intelligence

Great! You’ve successfully signed up.

Welcome back! You've successfully signed in.

You've successfully subscribed to AI Research Lab Intelligence.

Success! Check your email for magic link to sign-in.

Success! Your billing info has been updated.

Your billing was not updated.