Noam Shazeer
Noam Shazeer is one of the most consequential AI researchers of the modern era, best known as a co-author of the landmark 2017 paper 'Attention Is All You Need,' which introduced the Transformer architecture. That single contribution became the foundational building block for GPT, BERT, PaLM, and virtually every large language model powering today's AI-driven advertising, content generation, and personalization systems. His work has had an outsized, if often uncredited, influence on AdTech through the AI infrastructure that now drives ad targeting, creative generation, and audience modeling at scale. Shazeer spent the bulk of his career at Google, joining in the early 2000s and contributing to core machine learning and search infrastructure. He was a key figure in Google Brain and contributed to systems like Mixture of Experts (MoE) architectures, which improve the efficiency of large-scale models — a critical concern for real-time ad serving and bidding systems. He left Google in 2021 to co-found Character.AI, a conversational AI company, alongside Daniel De Freitas. Character.AI was subsequently acquired by Google in a landmark 2024 deal valued at approximately $2.7 billion, bringing Shazeer back to Google. While Shazeer is not a traditional AdTech practitioner, his technical contributions are deeply embedded in the infrastructure of modern digital advertising. The Transformer models he helped create power semantic search, ad relevance ranking, creative optimization, and conversational ad interfaces. His influence on AdTech is foundational rather than operational — the industry runs on the architecture he helped invent.
Last updated Jun 24, 2026 by ATDb automated enrichment · Connections updated Jun 29, 2026
- Years in industry
- 22 years
Bio
Noam Shazeer is one of the most consequential AI researchers of the modern era, best known as a co-author of the landmark 2017 paper 'Attention Is All You Need,' which introduced the Transformer architecture. That single contribution became the foundational building block for GPT, BERT, PaLM, and virtually every large language model powering today's AI-driven advertising, content generation, and personalization systems. His work has had an outsized, if often uncredited, influence on AdTech through the AI infrastructure that now drives ad targeting, creative generation, and audience modeling at scale. Shazeer spent the bulk of his career at Google, joining in the early 2000s and contributing to core machine learning and search infrastructure. He was a key figure in Google Brain and contributed to systems like Mixture of Experts (MoE) architectures, which improve the efficiency of large-scale models — a critical concern for real-time ad serving and bidding systems. He left Google in 2021 to co-found Character.AI, a conversational AI company, alongside Daniel De Freitas. Character.AI was subsequently acquired by Google in a landmark 2024 deal valued at approximately $2.7 billion, bringing Shazeer back to Google. While Shazeer is not a traditional AdTech practitioner, his technical contributions are deeply embedded in the infrastructure of modern digital advertising. The Transformer models he helped create power semantic search, ad relevance ranking, creative optimization, and conversational ad interfaces. His influence on AdTech is foundational rather than operational — the industry runs on the architecture he helped invent.
Career
Co-founder & CEO
Character.AI · 2021-2024
Senior Research Scientist
Google Brain / Google · 2000-2021
Expertise & education
Expertise
Education
- B.S. Computer Science, Duke University
Speaking topics
Recognition
Notable achievements
- Co-authored 'Attention Is All You Need' (2017), introducing the Transformer architecture now foundational to all major LLMs
- Co-founded Character.AI, which was acquired by Google in 2024 for approximately $2.7 billion
- Contributed to Mixture of Experts (MoE) research enabling efficient large-scale model deployment
- Key contributor to Google's core machine learning and search ranking infrastructure over two decades
Awards
Publications
- Vaswani, A., Shazeer, N., et al. — 'Attention Is All You Need' (NeurIPS 2017)
- Shazeer, N., et al. — 'Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer' (ICLR 2017)
- Shazeer, N. — 'GLU Variants Improve Transformer' (2020)