Skip to content
NS

Noam Shazeer

Thought Leader

Noam Shazeer is one of the most consequential AI researchers of the modern era, best known as a co-author of the landmark 2017 paper 'Attention Is All You Need,' which introduced the Transformer architecture. That single contribution became the foundational building block for GPT, BERT, PaLM, and virtually every large language model powering today's AI-driven advertising, content generation, and personalization systems. His work has had an outsized, if often uncredited, influence on AdTech through the AI infrastructure that now drives ad targeting, creative generation, and audience modeling at scale. Shazeer spent the bulk of his career at Google, joining in the early 2000s and contributing to core machine learning and search infrastructure. He was a key figure in Google Brain and contributed to systems like Mixture of Experts (MoE) architectures, which improve the efficiency of large-scale models — a critical concern for real-time ad serving and bidding systems. He left Google in 2021 to co-found Character.AI, a conversational AI company, alongside Daniel De Freitas. Character.AI was subsequently acquired by Google in a landmark 2024 deal valued at approximately $2.7 billion, bringing Shazeer back to Google. While Shazeer is not a traditional AdTech practitioner, his technical contributions are deeply embedded in the infrastructure of modern digital advertising. The Transformer models he helped create power semantic search, ad relevance ranking, creative optimization, and conversational ad interfaces. His influence on AdTech is foundational rather than operational — the industry runs on the architecture he helped invent.

Last updated Jun 24, 2026 by ATDb automated enrichment · Connections updated Jun 29, 2026

Role
Research Scientist / Co-founder
Company
Google (via Character.AI acquisition)
Based
San Francisco, California, United States
Connections
1
Years in industry
22 years

Bio

Noam Shazeer is one of the most consequential AI researchers of the modern era, best known as a co-author of the landmark 2017 paper 'Attention Is All You Need,' which introduced the Transformer architecture. That single contribution became the foundational building block for GPT, BERT, PaLM, and virtually every large language model powering today's AI-driven advertising, content generation, and personalization systems. His work has had an outsized, if often uncredited, influence on AdTech through the AI infrastructure that now drives ad targeting, creative generation, and audience modeling at scale. Shazeer spent the bulk of his career at Google, joining in the early 2000s and contributing to core machine learning and search infrastructure. He was a key figure in Google Brain and contributed to systems like Mixture of Experts (MoE) architectures, which improve the efficiency of large-scale models — a critical concern for real-time ad serving and bidding systems. He left Google in 2021 to co-found Character.AI, a conversational AI company, alongside Daniel De Freitas. Character.AI was subsequently acquired by Google in a landmark 2024 deal valued at approximately $2.7 billion, bringing Shazeer back to Google. While Shazeer is not a traditional AdTech practitioner, his technical contributions are deeply embedded in the infrastructure of modern digital advertising. The Transformer models he helped create power semantic search, ad relevance ranking, creative optimization, and conversational ad interfaces. His influence on AdTech is foundational rather than operational — the industry runs on the architecture he helped invent.

Career

  • Co-founder & CEO

    Character.AI · 2021-2024

  • Senior Research Scientist

    Google Brain / Google · 2000-2021

Expertise & education

Expertise

Large Language Models (LLMs)Transformer ArchitectureMixture of Experts (MoE)Machine Learning InfrastructureAI-Driven PersonalizationNatural Language ProcessingConversational AIAd Relevance and Ranking Systems

Education

  • B.S. Computer Science, Duke University

Speaking topics

Large Language Models and their applicationsTransformer architecture and attention mechanismsEfficient scaling of AI modelsConversational AI and human-computer interaction

Recognition

Notable achievements

  • Co-authored 'Attention Is All You Need' (2017), introducing the Transformer architecture now foundational to all major LLMs
  • Co-founded Character.AI, which was acquired by Google in 2024 for approximately $2.7 billion
  • Contributed to Mixture of Experts (MoE) research enabling efficient large-scale model deployment
  • Key contributor to Google's core machine learning and search ranking infrastructure over two decades

Awards

Test of Time Award — 'Attention Is All You Need' paper widely recognized as one of the most cited and impactful ML papers in history

Publications

  • Vaswani, A., Shazeer, N., et al. — 'Attention Is All You Need' (NeurIPS 2017)
  • Shazeer, N., et al. — 'Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer' (ICLR 2017)
  • Shazeer, N. — 'GLU Variants Improve Transformer' (2020)
Connection details