DeepMind’s new 280 billion-parameter language model kicks GPT-3’s butt in accuracy

Bigger isn't always better. DeepMind's Gopher system uses smarter algorithms to make better choices. And it blows GPT-3 away.

December 8, 2021 · 1 min · 35 words · Daniel Cisneros

Table of Contents

40% off TNW Conference!

But the team found that other areas of LLM architecture didnt benefit as much from brute force.

Dont get me wrong, Gopher has significantly more parameters than GPT-3.

DeepMind’s new 280 billion-parameter language model kicks GPT-3’s butt in accuracy

Also tagged with

A figure from DeepMind’s press release