Evaluating Long Context Large Language Models | by Yennie Jun

There’s a race in direction of language fashions with longer context home windows. However how good are they, and the way can we all know?

The context window of language fashions have been rising at an exponential price in the previous couple of years. Determine created by the creator.

This text was initially revealed on Art Fish Intelligence.

The context window of huge language fashions — the quantity of textual content they will course of without delay — has been growing at an exponential price.

In 2018, language fashions like BERT, T5, and GPT-1 might take as much as 512 tokens as enter. Now, in summer time of 2024, this quantity has jumped to 2 million tokens (in publicly accessible LLMs). However what does this imply for us, and the way can we consider these more and more succesful fashions?

The not too long ago launched Gemini 1.5 Pro model can take in up to 2 million tokens. However what does 2 million tokens even imply?

If we estimate 4 phrases to roughly equal about 3 tokens, it implies that 2 million tokens can (virtually) match your complete Harry Potter and Lord of the Ring sequence.

(The whole phrase depend of all seven books within the Harry Potter sequence is 1,084,625. The whole phrase depend of all seven books within the Lord of the Ring sequence is 481,103. (1,084,625 +…

Source link

The Invisible Revolution: How Vectors Are (Re)defining Business Success | by Felix Schmidt | Jan, 2025

Great Books for AI Engineering. 10 books with valuable insights about… | by Duncan McKinnon | Jan, 2025

AI Ethics for the Everyday User — Why Should You Care? | by Murtaza Ali | Jan, 2025

Despite return, Rams should still prepare for future without Stafford

New Coin Listing – Sealana Crypto Presale Hits $5 Million, 24 Hours Left

Financial Peace University vs. True Financial Freedom vs. Crown Financial MoneyLife

Nigeria not an easy place for startups

Best AI Nude Generators Revealed (2024)

Our Picks

Calls for police to review new evidence in missing Levi Davis case

UN says priority must be easing Gaza suffering, warns security poses aid challenge

Watch: Chubb opens up the scoring for the Browns on ‘TNF’

Most Popular

Despite return, Rams should still prepare for future without Stafford

New Coin Listing – Sealana Crypto Presale Hits $5 Million, 24 Hours Left

Financial Peace University vs. True Financial Freedom vs. Crown Financial MoneyLife

Evaluating Long Context Large Language Models | by Yennie Jun | Jul, 2024

There’s a race in direction of language fashions with longer context home windows. However how good are they, and the way can we all know?

Related Posts