We in Telegram
Add news
March 2010 April 2010 May 2010 June 2010 July 2010
August 2010
September 2010 October 2010
November 2010
December 2010
January 2011
February 2011 March 2011 April 2011 May 2011 June 2011 July 2011 August 2011 September 2011 October 2011 November 2011 December 2011 January 2012 February 2012 March 2012 April 2012 May 2012 June 2012 July 2012 August 2012 September 2012 October 2012 November 2012 December 2012 January 2013 February 2013 March 2013 April 2013 May 2013 June 2013 July 2013 August 2013 September 2013 October 2013 November 2013 December 2013 January 2014 February 2014 March 2014 April 2014 May 2014 June 2014 July 2014 August 2014 September 2014 October 2014 November 2014 December 2014 January 2015 February 2015 March 2015 April 2015 May 2015 June 2015 July 2015 August 2015 September 2015 October 2015 November 2015 December 2015 January 2016 February 2016 March 2016 April 2016 May 2016 June 2016 July 2016 August 2016 September 2016 October 2016 November 2016 December 2016 January 2017 February 2017 March 2017 April 2017 May 2017 June 2017 July 2017 August 2017 September 2017 October 2017 November 2017 December 2017 January 2018 February 2018 March 2018 April 2018 May 2018 June 2018 July 2018 August 2018 September 2018 October 2018 November 2018 December 2018 January 2019 February 2019 March 2019 April 2019 May 2019 June 2019 July 2019 August 2019 September 2019 October 2019 November 2019 December 2019 January 2020 February 2020 March 2020 April 2020 May 2020 June 2020 July 2020 August 2020 September 2020 October 2020 November 2020 December 2020 January 2021 February 2021 March 2021 April 2021 May 2021 June 2021 July 2021 August 2021 September 2021 October 2021 November 2021 December 2021 January 2022 February 2022 March 2022 April 2022 May 2022 June 2022 July 2022 August 2022 September 2022 October 2022 November 2022 December 2022 January 2023 February 2023 March 2023 April 2023 May 2023 June 2023 July 2023 August 2023 September 2023 October 2023 November 2023 December 2023 January 2024 February 2024 March 2024 April 2024 May 2024
1 2 3 4 5 6 7 8 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
News Every Day |

Are LLMs About To Hit A Wall? | Commentary

Each new generation of large language model (LLM) consumes a staggering amount of resources. 

Meta, for instance, trained its new Llama 3 models with about 10 times more data and 100 times more compute than Llama 2. Amid a chip shortage, it used two 24,000 GPU clusters, with each chip running around the price of a luxury car. It employed so much data in its AI work, it considered buying the publishing house Simon & Schuster to find more. 

Afterward, even its executives wondered aloud if the pace was sustainable.

“It is unclear whether we need to continue scaling or whether we need more innovation on post-training,” Ahmad Al-Dahle, Meta’s VP of GenAI, told me in an interview last week. “Is the infrastructure investment unsustainable over the long run? I don’t think we know.”

For Meta — and its counterparts running large language models — the question of whether throwing more data, compute, and energy at the problem will lead to further scale looms large. Since LLMs entered the popular imagination, the best path to exponential improvement seemed to be combining these ingredients and allowing the magic to happen. But with the top bound of all three potentially in sight, the industry will need newer techniques, more efficient training, and custom built hardware to progress. Without advances in these areas, LLMs may indeed hit a wall.

The path of continued scale probably starts with better methods to train and run LLMs, some of which is already in motion. “We are starting to see new kinds of architectures that are going to change how these models scale in the future,” Swami Sivasubramanian, VP of AI and Data at Amazon Web Services, told me in an interview Thursday night. Sivasubramanian said researchers within Stanford and elsewhere are getting models to learn faster, with the same amount of data, and 10 times cheaper inference. “I’m actually very optimistic about the future when it comes to novel model architectures, which has the potential to disrupt the space,” he said.  

Already, new methods of training these models seem to be paying off. “The smallest Llama 3 is basically as powerful as the the biggest Llama 2,” Mark Zuckerberg said on the Dwarkesh Patel podcast last week. 

To fuel these models — and get around potential bottlenecks in exhausting real world data — synthetic data created by AI is playing a key role. Though not fully proven yet, this data already made its way into model training. “Our coding abilities on Llama 3 is exceptionally high,” Meta’s Al-Dahle said. “Part of that was really being innovative and pushing on our ability to leverage models to generate synthetic data.” 

Along with finding better models, LLM progress likely depends on building better chips that can train and run these models faster and more efficiently than traditional chips. While NVIDIA GPUs are exceptionally useful for large language models, they aren’t purpose-built for them. Now some chips built specifically for generative AI are showing promise. Researchers like Andrew Ng have praised Groq, one buzzy name, as the type of chip that works fast enough to take generative AI to the next level, especially as the field pushes toward agents. 

Meanwhile, companies like Amazon, Intel, Google and others are building “accelerators,” or custom chips that can run AI processes fast. At Amazon, Sivasubramanian said, the company’s purpose built Trainium chips are “designed with the sole purpose of being able to train these large language models” and already four times faster than the first generation. 

Given the need and the opportunity ahead, it’s no wonder OpenAI CEO Sam Altman is reportedly raising a lot of money to build chips powerful enough to achieve his aims.

The one LLM constraint that’s been little discussed is energy, and it may be the most important. “There’s a capital question of — at what point does it stop being worth it to put the capital in? — but I actually think before we hit that, you’re going to run into energy constraints,” Zuckerberg told Patel. He floated the idea of building a 1 gigawatt datacenter to advance AI, or something approximating a meaningful nuclear power plant. But given regulatory approvals and the build outs complexity, it could take years to produce. “I think it will happen,” he said. “This is only a matter of time.”

Until we get to such massive energy allocation, it may be difficult to say how much room LLMs have left to improve. But it seems like sooner or later, we will find out. “I am not thinking about it myself,” Sivasubramanian said with a laugh, of a nuclear-level plant to run AI models, “but I can’t speak to my infra team.”

The post Are LLMs About To Hit A Wall? | Commentary appeared first on TheWrap.

Елена Рыбакина

Рыбакина поделилась ожиданиями от турнира в Риме, на котором ей предстоит защищать титул

Driving Northern California 8K Dolby Vision HDR - Pebble Beach to San Francisco

Driving Los Angeles 8K HDR Dolby Vision - USC to Manhattan Beach

Sci-Fi Short Film BackSpace Forever - DUST - Online Premiere

Seven reasons Sporting are champions of Portugal

Ria.city






Read also

Trichologist reveals you should actually wash your hair every day if you exercise – you can do ‘more damage’ if you skip

Mom of illegal accused of hiring hitman to kill witnesses to protect murderer son

‘He’s not your bellboy’ influencer slammed for humiliating ‘weakling’ flight attendant who refused to help her lift bag

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Driving Northern California 8K Dolby Vision HDR - Pebble Beach to San Francisco

Today24.pro — latest news 24/7. You can add your news instantly now — here


News Every Day

Exclusive - Kettan Singh apologises to Karan Johar after filmmaker expresses disappointment over his mimicry on Madness Machayenge; says 'My intention was never to hurt him'



Sports today


Новости тенниса
ATP

Рублёв поднимется на шестое место в рейтинге ATP после победы на «Мастерсе» в Мадриде



Спорт в России и мире
Москва

В преддверии Дня Победы в Великой Отечественной войне: уроки мужества и познавательно-спортивная эстафета



All sports news today





Sports in Russia today

Москва

Турнир по пляжному волейболу «ЮМР ОПЕН» прошел в Краснодаре


Новости России

Game News

Five new Steam games you probably missed (May 6, 2024)


Russian.city


Джиган

Джиган будет продавать пиво и энергетики


Губернаторы России
Игорь Маковский

Игорь Маковский: оперативный Штаб «Россети Центр» осуществляет усиленный контроль за работой электросетевого комплекса 


Появилось видео смертельного ДТП со студентом в Нижнем Новгороде

Ветеран Лидия Черная поделилась воспоминаниями о войне

5 фактов, которые необходимо знать о СЭЙН и Wimmortal и их релизе «Старик и воля».

Путин сообщил о рекордном товарообороте с Арменией


Shot: рэпер Джиган стал совладельцем фирмы по продаже пива и энергетиков

Джиган будет продавать пиво и энергетики

Концерт ко Дню Победы проведут в Культурном центре «Интеграция» на Лазо

В день рождения White Queen собрала гостей на вечеринку в ARBAT 21


После победы в Мадриде Рублев поднялся с 8-го на 6-е место в рейтинге ATP

Лучший теннисист Казахстана сдал позиции в чемпионской гонке ATP

Паула Бадос и Стефанос Циципас расстались | Виды спорта

Павлюченкова официально аккредитовала на турнир в Риме двух своих собак



Состоялась премьера песни Гульдарии Юсуповой

История любви символов театра – в постановке Юрия Грымова

5 фактов, которые необходимо знать о СЭЙН и Wimmortal и их релизе «Старик и воля».

Что лечит невролог и с какими симптомами к нему обращаться?


А броня, как у вертолета: какие автомобили предпочитает Владимир Путин

Почему в Москве принимают Пашиняна после всех его скандальных заявлений о России: Политолог назвал пять причин

Боевой ничьей закончился матч воронежского «Факела» с «Зенитом»

Владимир Путин вступил в должность президента РФ на новый шестилетний срок


Звезду «Аншлага» Юрия Аскарова госпитализировали в Москве

МИД РФ рекомендовал учитывать риски при планировании поездок в Мексику

Турнир по шахматам «Спасибо деду за Победу» прошел в Ступине

Учительница Путина Вера Гуревич заявила, что он всегда был защитником



Путин в России и мире






Персональные новости Russian.city
Анастасия Волочкова

Волочкова спасала платье и чуть не утонула на Мальдивах



News Every Day

Sci-Fi Short Film BackSpace Forever - DUST - Online Premiere




Friends of Today24

Музыкальные новости

Персональные новости