News Every Day | 30 December 2024, 17:00

OpenAI Promises the Next Model of ChatGPT Will Be Better at Reasoning

OpenAI has unveiled a new model for its products, arriving for users near the end of January, 2025: It's called o3 (we seem to have jumped over o2), and it promises another significant step forward in AI reasoning. According to its developers, it will make tools like ChatGPT better than ever at programming and working out math problems.

OpenAI CEO Sam Altman described o3 as "incredibly smart" in the video announcing the model, released as part of his company's "12 Days of OpenAI" promotion over the holiday season. The model is undergoing a variety of safety tests before it launches in full—first likely only for paying ChatGPT Plus users.

The o3 model is more than 20 percent better than the previous o1 model at coding, per the SWE-bench Verified benchmark, OpenAI says. It also scores strongly on math and science problems, at least according to benchmark tests—like o1, the o3 model is trained to think and reason before it answers, rigorously testing its responses for accuracy. OpenAI will also release a smaller, faster o3-mini model alongside the main update.

The pattern of completing squares with a darker blue square is simple for humans, but hard for AI—and it's a challenge that's part of ARC. Credit: ARC

We won't know just how good o3 is until users can actually test it, but we already have an idea of what o3 can do because it's been tested against the well-known Abstraction and Reasoning Corpus (ARC) challenge, designed to track AI's progress towards Artificial General Intelligence (AGI)—the somewhat contentious point at which AI cognitive capabilities pass those of humans.

This challenge gets AI to come up with new approaches to problems, rather than just relying on its memory, and involves a series of visual tasks for models to complete. They must match patterns in colored grids, exercises intended to be easy for people to complete without any training, but hard for AI to figure out.

Within the computing power boundaries of the ARC test, o3 scored 75.7%. That's way above the 5% achieved by the GPT-4o model, currently the best ChatGPT model available to free users. While we're still some way short of AGI (the model is still below human scores, and couldn't complete all the tasks), that's an impressive step up.

o1 and o1-mini are currently available to ChatGPT Plus users. Credit: Lifehacker

"OpenAI's new o3 model represents a significant leap forward in AI's ability to adapt to novel tasks," writes François Chollet, the software engineer who designed the ARC test. "This is not merely incremental improvement, but a genuine breakthrough, marking a qualitative shift in AI capabilities compared to the prior limitations of LLMs."

Predictably, OpenAI didn't talk about the energy demands of AI, the ethics of training AI on publicly available data that may be copyrighted, or the tendency for these models to hallucinate wrong answers—while mistakes should be fewer because of o3's extra thinking time, they won't be eradicated. What the company did mention is an expansion of its safety testing program, designed to prevent these models from being used for malicious purposes.

The ability for AI models to truly "think" or "reason"—or at least attempt some approximation of those human capabilities—will no doubt continue to be discussed as AI development progresses. Google has also just unveiled its Gemini 2.0 model, which brings with it improved reasoning.

VIP

«Благодаря Вам об этой песне узнали все»: Филипп Киркоров поздравил KAYA в шоу «Звездные танцы»

Today24.pro

VCs say AI agents will be big in 2025. Here's an exclusive look at the pitch deck one startup in the space used to raise $20 million.

Bigg Boss 18: Chaahat Pandey's mother provokes Eisha Singh's mom over Shalin Bhanot relationship, leading to a heated confrontation

A giant Cheez-It officiated a wedding at the Citrus Bowl because we are a post-shame society

Bigg Boss 18 promo: Kangana Ranaut teases Karan Veer Mehra and Chum Darang over their bond; says, “Just friends ka natak acha kar lete ho tum”

Ria.city

Read also

Yesterday, 16:28

Broccoli listeria warning hits Walmart stores, says FDA recall website: Symptoms, list of states, what to know

Yesterday, 13:33

Outrage in Senegal after 9-year-old girl becomes pregnant after being raped

Yesterday, 18:27

Fox News ‘Antisemitism Exposed’ Newsletter: Jews around the globe felt 'tipping point' in hate in 2024

Moscow.media

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Chess world fumes over joint world blitz title for Carlsen and Nepo

Today24.pro — latest news 24/7. You can add your news instantly now — here

News Every Day

Bigg Boss 18 promo: Kangana Ranaut teases Karan Veer Mehra and Chum Darang over their bond; says, “Just friends ka natak acha kar lete ho tum”

Sports today

Новости тенниса

Даниил Медведев

Возрождение Овечкина, четыре титула в PFL, неудачи Большунова и Медведева: главные события 2024 года в российском спорте

News.tennis

Рублёв и Хачанов вышли в полуфинал турнира ATP в Гонконге в парном разряде Касаткина за три с половиной часа одолела американку Стирнс на турнире WTA Рублёв выпадет из топ-8 после поражения на турнире ATP в Гонконге Андреева вышла в четвертьфинал WTA 500 в Брисбене, обыграв Носкову

Спорт в России и мире

Москва

Сергей Собянин. Главное за день

All sports news today

Sports in Russia today

Москва

Джимми Картер и бойкот Олимпиады-80: наследие 39-го президента США

Новости России

Game News

Сергей Собянин. Главное за день

Путаница в МЧС по Москве: однофамильцев ошибочно наградили за отвагу

В Новомосковске готовят новых звезд театра и кино

Букмекеры правят не только спортом, но и душами болельщиков

Poisk-music.ru

Секретный ингредиент: рэпер Баста поделился семейным рецептом оливье

«Желайте!»: Шнуров посоветовал в Новом году слушать себя

«Благодаря Вам об этой песне узнали все»: Филипп Киркоров поздравил KAYA в шоу «Звездные танцы»

Росгвардия приглашает на службу

News.tennis

"Уехала в США, родила от друга принца Уильяма и принца Гарри". Как сложилась судьба Марии Шараповой

Брисбен (ATP). 2-й круг. Димитров поборется с Вукичем, Лехечка – с Нишиокой

Рублёв и Хачанов вышли в полуфинал турнира ATP в Гонконге в парном разряде

Капризов стал лучшим снайпером года в НХЛ, Кудерметова прошла Касаткину. Главное к утру

Russian.city

Новогодний флешмоб прошел в столичном главке Росгвардии

Селлеры России снижают цены на Apple-технику благодаря параллельному импорту

Росгвардейцы эвакуировали трех граждан из задымленной квартиры в Тамбове

Mash: Девочке обожгло лицо искрами от пиротехники на спектакле в театре Бабкиной

Bigpot.news

Создание Модели голоса. Создание Модели своего голоса.

Росгвардейцы Башкортостана обеспечили правопорядок и безопасность граждан в новогоднюю ночь

Apple переименует новый iPhone SE в iPhone 16E

Росгвардия приглашает на службу

29ru.net

Росавиация расследует инцидент с выкатившимся за пределы ВПП Airbus в Норильске

Археологи нашли фрагмент книжной застежки XVIII века

Изменения в правилах начисления платы за ЖКУ: что важно знать жильцам УК и ТСЖ

Детей из Подмосковья отправили на отдых в оздоровительный центр «Родина»

Путин в России и мире

Russia24.pro

Новогодний флешмоб прошел в столичном главке Росгвардии Селлеры России снижают цены на Apple-технику благодаря параллельному импорту Mash: Девочке обожгло лицо искрами от пиротехники на спектакле в театре Бабкиной Росгвардейцы эвакуировали трех граждан из задымленной квартиры в Тамбове

Life24.pro

"Коготок увяз - и птичке конец": Пинчук прямо сказал, что на самом деле означало требование Алиева к России Алкоголь и баня – коктейль смерти? Врач Кутушов объясняет, почему это опасно Рэпер Моргенштерн возьмет перерыв в карьере ради лечения Не допустите этого 1 января: токсиколог Кутушов предупредил о трех видах алкогольной беды

Агрегатор новостей 24СМИ

123ru.net

Собянин рассказал, как Москва провела год в статусе молодежной столицы России Глава ФТР Тарпищев заявил, что "Спартаку" нужно усилить центр защиты Пенсионеры назвали самые любимые направления в новогодние праздники Продвижение Песни или Музыки в YouTube, RuTube, ВКонтакте, ЯндексДзен и других видеоплощадках!

Персональные новости

Today24.pro

Chess world fumes over joint world blitz title for Carlsen and Nepo VCs say AI agents will be big in 2025. Here's an exclusive look at the pitch deck one startup in the space used to raise $20 million. Bigg Boss 18 promo: Kangana Ranaut teases Karan Veer Mehra and Chum Darang over their bond; says, “Just friends ka natak acha kar lete ho tum” A giant Cheez-It officiated a wedding at the Citrus Bowl because we are a post-shame society

Russian.city

Валерий Леонтьев

Леонтьев возвращается на большую сцену и готовится к мировому турне

Агрегатор новостей 24СМИ

News Every Day

Bigg Boss 18: Chaahat Pandey's mother provokes Eisha Singh's mom over Shalin Bhanot relationship, leading to a heated confrontation

Today24.pro

Chess world fumes over joint world blitz title for Carlsen and Nepo Bigg Boss 18: Chaahat Pandey's mother provokes Eisha Singh's mom over Shalin Bhanot relationship, leading to a heated confrontation A giant Cheez-It officiated a wedding at the Citrus Bowl because we are a post-shame society VCs say AI agents will be big in 2025. Here's an exclusive look at the pitch deck one startup in the space used to raise $20 million.

123ru.net

Около 97 процентов нормы осадков выпало в Москве в 2024 году Новые адреса получили 315 тысяч объектов недвижимости в Москве в 2024 году Ходим в гости: Тайная жизнь петербургской квартиры: интерьер, жильцы и их истории Продвижение Песни или Музыки в YouTube, RuTube, ВКонтакте, ЯндексДзен и других видеоплощадках!

Friends of Today24

Музыкальные новости

Агрегатор новостей 24СМИ

Персональные новости

«Благодаря Вам об этой песне узнали все»: Филипп Киркоров поздравил KAYA в шоу «Звездные танцы»

VCs say AI agents will be big in 2025. Here's an exclusive look at the pitch deck one startup in the space used to raise $20 million.

Bigg Boss 18: Chaahat Pandey's mother provokes Eisha Singh's mom over Shalin Bhanot relationship, leading to a heated confrontation

A giant Cheez-It officiated a wedding at the Citrus Bowl because we are a post-shame society

Bigg Boss 18 promo: Kangana Ranaut teases Karan Veer Mehra and Chum Darang over their bond; says, “Just friends ka natak acha kar lete ho tum”

Read also

Broccoli listeria warning hits Walmart stores, says FDA recall website: Symptoms, list of states, what to know

Outrage in Senegal after 9-year-old girl becomes pregnant after being raped

Fox News ‘Antisemitism Exposed’ Newsletter: Jews around the globe felt 'tipping point' in hate in 2024

Chess world fumes over joint world blitz title for Carlsen and Nepo

Bigg Boss 18 promo: Kangana Ranaut teases Karan Veer Mehra and Chum Darang over their bond; says, “Just friends ka natak acha kar lete ho tum”

Sports today

Возрождение Овечкина, четыре титула в PFL, неудачи Большунова и Медведева: главные события 2024 года в российском спорте

Сергей Собянин. Главное за день

All sports news today

Sports in Russia today

Джимми Картер и бойкот Олимпиады-80: наследие 39-го президента США

F2P мобильный порт Guardians of Holme вышел на смартфонах в Китае

Москвичка отдала мошенникам 280 тысяч рублей, желая попасть в Большой театр

Джимми Картер и Олимпиада 1980: Почему бывший президент сожалеет спустя 40 лет

Сергей Собянин. Главное за день

Путаница в МЧС по Москве: однофамильцев ошибочно наградили за отвагу

В Новомосковске готовят новых звезд театра и кино

Букмекеры правят не только спортом, но и душами болельщиков

Секретный ингредиент: рэпер Баста поделился семейным рецептом оливье

«Желайте!»: Шнуров посоветовал в Новом году слушать себя

«Благодаря Вам об этой песне узнали все»: Филипп Киркоров поздравил KAYA в шоу «Звездные танцы»

Росгвардия приглашает на службу

"Уехала в США, родила от друга принца Уильяма и принца Гарри". Как сложилась судьба Марии Шараповой

Брисбен (ATP). 2-й круг. Димитров поборется с Вукичем, Лехечка – с Нишиокой

Рублёв и Хачанов вышли в полуфинал турнира ATP в Гонконге в парном разряде

Капризов стал лучшим снайпером года в НХЛ, Кудерметова прошла Касаткину. Главное к утру

Новогодний флешмоб прошел в столичном главке Росгвардии

Селлеры России снижают цены на Apple-технику благодаря параллельному импорту

Росгвардейцы эвакуировали трех граждан из задымленной квартиры в Тамбове

Mash: Девочке обожгло лицо искрами от пиротехники на спектакле в театре Бабкиной

Создание Модели голоса. Создание Модели своего голоса.

Росгвардейцы Башкортостана обеспечили правопорядок и безопасность граждан в новогоднюю ночь

Apple переименует новый iPhone SE в iPhone 16E

Росгвардия приглашает на службу

Росавиация расследует инцидент с выкатившимся за пределы ВПП Airbus в Норильске

Археологи нашли фрагмент книжной застежки XVIII века

Изменения в правилах начисления платы за ЖКУ: что важно знать жильцам УК и ТСЖ

Детей из Подмосковья отправили на отдых в оздоровительный центр «Родина»

Леонтьев возвращается на большую сцену и готовится к мировому турне

Bigg Boss 18: Chaahat Pandey's mother provokes Eisha Singh's mom over Shalin Bhanot relationship, leading to a heated confrontation

Friends of Today24