News Every Day | Yesterday, 20:18

OpenAI recently unveiled its latest artificial intelligence (AI) models, o1-preview and o1-mini (also referred to as “Strawberry”), claiming a significant leap in the reasoning capabilities of large language models (the technology behind Strawberry and OpenAI’s ChatGPT). While the release of Strawberry generated excitement, it also raised critical questions about its novelty, efficacy and potential risks.

Central to this is the model’s ability to employ “chain-of-thought reasoning” – a method similar to a human using a scratchpad, or notepad, to write down intermediate steps when solving a problem.

Chain-of-thought reasoning mirrors human problem solving by breaking down complex tasks into simpler, manageable sub-tasks. The use of scratchpad-like reasoning in large language models is not a new idea.

The ability to perform chain-of-thought reasoning by AI systems not specifically trained to do so was first observed in 2022 by several research groups. These included Jason Wei and colleagues from Google Research and Takeshi Kojima and colleagues from the University of Tokyo and Google.

Before these works, other researchers such as Oana Camburu from the University of Oxford and her colleagues investigated the idea of teaching models to generate text-based explanations for their outputs. This is where the model describes the reasoning steps that it went through in order to produce a particular prediction.

Even earlier than this, researchers including Jacob Andreas from the Massachusetts Institute of Technology explored the idea of language as a tool for deconstructing complex problems. This enabled models to break down complex tasks into sequential, interpretable steps. This approach aligns with the principles of chain-of-thought reasoning.

Strawberry’s potential contribution to the field of AI could lie in scaling up these concepts.

A closer look

Although the exact method used by OpenAI for Strawberry is shrouded in mystery, many experts think that it uses a procedure known as “self-verification”.

This procedure improves the AI system’s own ability to perform chain-of-thought reasoning. Self-verification is inspired by how humans reflect and play out scenarios in their minds to make their reasoning and beliefs consistent.

Most recent AI systems based on large language models, such as Strawberry, are built in two stages. They first go through a process called “pre-training”, where the system acquires its basic knowledge by running through a large general dataset of information.

Chain-of-thought reasoning has similarities with the way people write down intermediate steps on a notepad when solving a problem. Earth Phakphum/Shutterstock

They can then undergo fine-tuning, where they are taught to perform specific tasks better, typically by being provided with additional, more specialised data.

This additional data is often curated and “annotated” by humans. This is where a person provides the AI system with additional context to aid its understanding of the training data. However, Strawberry’s self-verification approach is thought by some to be less data-hungry. Yet, there are indications that some of the o1 AI models were trained on extensive examples of chain-of-thought reasoning that have been annotated by experts.

This raises questions about the extent to which self-improvement, rather than expert-guided training, contributes to its capabilities. In addition, while the model may excel in certain areas, its reasoning proficiency does not surpass basic human competence in others. For example, versions of Strawberry still struggle with some mathematical reasoning problems that a capable 12-year-old can solve.

Risks and opacity

One primary concern with Strawberry is the lack of transparency surrounding the self-verification process and how it works. The reflection that the model performs upon its reasoning is not available to be examined, depriving users of insights into the system’s functioning.

The “knowledge” relied upon by the AI system to answer a given query is not available for inspection either. This means there is no way to edit or specify the set of facts, assumptions, and deduction techniques to be used.

Consequently, the system may produce answers that appear to be correct, and reasoning that appears sound, when in fact they are fundamentally flawed, potentially leading to misinformation.

Finally, OpenAI has built in protections to prevent undesirable uses of o1. But a recent report by OpenAI, that evaluates the system’s performance, did uncover some risks. Some researchers we have spoken to have shared their concerns, particularly regarding the potential for misuse by cyber-criminals.

The model’s ability to intentionally mislead or produce deceptive outputs – outlined in the report – adds another layer of risk, emphasising the need for stringent safeguards.

The authors do not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and have disclosed no relevant affiliations beyond their academic appointment.

News Every Day

Las Vegas GP F1 qualifying: George Russell takes pole, Lewis Hamilton only 10th

Today24.pro

F1 Las Vegas Grand Prix – Start time, starting grid, how to watch, & more

Sky Sports commentator stunned by ‘one of the strangest reactions to a goal I’ve ever seen’ by Watford fans

Exclusive: Sumit Kaul on joining the new season of Tenali Rama as Girgit; says ‘It will be a challenge for me to live up to the expectations of audience’

Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break

Ria.city

Read also

Yesterday, 23:30

How to watch the 2024 PFL Championship: Who's fighting, lineup, start time, preview videos, more

Yesterday, 22:00

‘I’m already crashing out’: Victoria’s Secret worker exposes the truth about the back room during the holidays

Yesterday, 20:13

Jaden Ivey: Cade Cunningham and I have the talent to be one of the best backcourts ever

Moscow.media

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Exclusive: Sumit Kaul on joining the new season of Tenali Rama as Girgit; says ‘It will be a challenge for me to live up to the expectations of audience’

Today24.pro — latest news 24/7. You can add your news instantly now — here

News Every Day

Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break

Sports today

Новости тенниса

WTA

«Немного недотягиваю до Соболенко». 19-летняя россиянка сравнила себя с теннисистками WTA

News.tennis

Миранчук с «Атлантой» выбыл из плей-офф МЛС, Синнер выиграл Кубок Дэвиса. Главное к утру Кубок Дэвиса — 2024: церемония прощания Рафаэля Надаля с теннисом вызвала критику, почему не приехал Новак Джокович Зарина Дияс узнала хорошую новость от WTA Синнер: Защита титула — одно из лучших чувств

Спорт в России и мире

Москва

В Москве судья сделала замечание перекрестившемуся мальчику-спортсмену: русские требуют поставить её на место

All sports news today

Sports in Russia today

Москва

В Москве судья сделала замечание перекрестившемуся мальчику-спортсмену: русские требуют поставить её на место

Новости России

Game News

The community behind the PC port of Ocarina of Time have been secretly working on a native version of Star Fox 64

Russian.city

Москва

Митрополит Матфей поздравил духовника обители милосердия в Новоржевском округе Иоанна Миронова с 98-летием

Губернаторы России

Зенит

"Зенит" обыграл "Динамо" в Москве: счет 3-1 в пользу гостей

News-life

Полюс Папанина. Как полярники с дрейфующей станции стали народными кумирами

«Грузовичкоф» на передовой новых коллабораций с блогерами: выступление Наталии Поникаровской на конференции The Trends

Филиал № 4 ОСФР по Москве и Московской области информирует: В Москве и Московской области 650 тысяч пенсионеров старше 80 лет получают пенсию в повышенном размере

КАК ВСЕМ ПРИБОРАМ ДЕЛАЮТ РЕЗУЛЬТАТЫ ИЗМЕРЕНИЙ? ПОТОМУ ЧТО ВСЕ ЦИВИЛИЗАЦИИ СИСТЕМАТИЗИРОВАНЫ ПРОГРАММНОЙ РАБОТОЙ. Россия, США, Европа могут улучшить отношения и здоровье общества?!

Poisk-music.ru

«Микробиотики микст» с антоцианами удостоены золотой медали на Международном Конкурсе качества

Shot: квартиру Ротару в Москве за 65 млн рублей сняли с продажи

Менеджер Песни. Менеджер Релиза Песни.

«Он считает себя идеальным». Оксана Самойлова обвинила Джигана в нарциссизме в новом выпуске «Большого переселения» на ТНТ

News.tennis

Кубок Дэвиса. 1/2 финала. Ван де Зандшульп сыграет с Альтмайером, Грикспор встретится со Штруффом

Кубок Дэвиса — 2024: церемония прощания Рафаэля Надаля с теннисом вызвала критику, почему не приехал Новак Джокович

Миранчук с «Атлантой» выбыл из плей-офф МЛС, Синнер выиграл Кубок Дэвиса. Главное к утру

Теннисисты из Италии второй раз подряд выиграли Кубок Дэвиса

Russian.city

С глаз долой, из сердца - вон: что делают россияне с подарками бывших

Филиал № 4 ОСФР по Москве и Московской области информирует: В Москве и Московской области 650 тысяч пенсионеров старше 80 лет получают пенсию в повышенном размере

«Грузовичкоф» на передовой новых коллабораций с блогерами: выступление Наталии Поникаровской на конференции The Trends

Bloody - участник и технический партнер Red Expo-2024

Bigpot.news

Певец Кристовский посетил пункт отбора граждан на военную службу в Москве

Рэпер Guf анонсировал прощальный тур в 2025 году

Посол Ирана призвал наказать полицейских, задержавших иранских студентов в Казани

29ru.net

ГАБТ: около 1,5 тысячи билетов на "Щелкунчик" отданы участникам СВО

Томичи стали чемпионами мира по универсальному бою

Митрополит Матфей поздравил духовника обители милосердия в Новоржевском округе Иоанна Миронова с 98-летием

Нижегородский ХК «Торпедо» разгромил «Спартак»

Путин в России и мире

Russia24.pro

С глаз долой, из сердца - вон: что делают россияне с подарками бывших Филиал № 4 ОСФР по Москве и Московской области информирует: В Москве и Московской области 650 тысяч пенсионеров старше 80 лет получают пенсию в повышенном размере С глаз долой, из сердца - вон: что делают россияне с подарками бывших Филиал № 4 ОСФР по Москве и Московской области информирует: Отделение СФР по Москве и Московской области оплатило свыше 243 тысяч дополнительных выходных дней по уходу за детьми с инвалидностью

Life24.pro

«Футбол-шоу» признано лучшей детской программой по версии Премии имени Эдуарда Сагалаева Итоги по завершению большого проекта «Дизайн-Перспектива 2024» «Микробиотики микст» с антоцианами удостоены золотой медали на Международном Конкурсе качества Волейболисты «Динамо» (Москва) в Marins Park Hotel Нижний Новгород

Агрегатор новостей 24СМИ

123ru.net

Подмосковные спортсмены завоевали 4 медали на чемпионате России по плаванию Захарова заявила о попытках Германии переписать историю в пользу Третьего рейха Многодетную семью из Домодедова наградили орденом «Родительская слава» Нижегородский ХК «Торпедо» разгромил «Спартак»

Персональные новости

Today24.pro

Sky Sports commentator stunned by ‘one of the strangest reactions to a goal I’ve ever seen’ by Watford fans F1 Las Vegas Grand Prix – Start time, starting grid, how to watch, & more Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break Exclusive: Sumit Kaul on joining the new season of Tenali Rama as Girgit; says ‘It will be a challenge for me to live up to the expectations of audience’

Russian.city

Linkin Park

Новый альбом Linkin Park занял вторую строчку чарта Billboard 200

Агрегатор новостей 24СМИ

News Every Day

Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break

Today24.pro

Sky Sports commentator stunned by ‘one of the strangest reactions to a goal I’ve ever seen’ by Watford fans Exclusive: Sumit Kaul on joining the new season of Tenali Rama as Girgit; says ‘It will be a challenge for me to live up to the expectations of audience’ F1 Las Vegas Grand Prix – Start time, starting grid, how to watch, & more Las Vegas GP F1 qualifying: George Russell takes pole, Lewis Hamilton only 10th

123ru.net

«Россияне готовы говорить, просто их никто не спрашивает. Они никого не интересуют» — блогер Витя Кравченко Reuters: «Газпром» прекратил поставки OMV после отбора ею газа без оплаты Митрополит Матфей поздравил духовника обители милосердия в Новоржевском округе Иоанна Миронова с 98-летием Москва и Стамбул названы самыми популярными направлениями для отдыха осенью

Friends of Today24

Музыкальные новости

Агрегатор новостей 24СМИ

Персональные новости

A closer look

Risks and opacity

Las Vegas GP F1 qualifying: George Russell takes pole, Lewis Hamilton only 10th

F1 Las Vegas Grand Prix – Start time, starting grid, how to watch, & more

Sky Sports commentator stunned by ‘one of the strangest reactions to a goal I’ve ever seen’ by Watford fans

Exclusive: Sumit Kaul on joining the new season of Tenali Rama as Girgit; says ‘It will be a challenge for me to live up to the expectations of audience’

Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break

Read also

How to watch the 2024 PFL Championship: Who's fighting, lineup, start time, preview videos, more

‘I’m already crashing out’: Victoria’s Secret worker exposes the truth about the back room during the holidays

Jaden Ivey: Cade Cunningham and I have the talent to be one of the best backcourts ever

Exclusive: Sumit Kaul on joining the new season of Tenali Rama as Girgit; says ‘It will be a challenge for me to live up to the expectations of audience’

Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break

Sports today

«Немного недотягиваю до Соболенко». 19-летняя россиянка сравнила себя с теннисистками WTA

В Москве судья сделала замечание перекрестившемуся мальчику-спортсмену: русские требуют поставить её на место

All sports news today

Sports in Russia today

В Москве судья сделала замечание перекрестившемуся мальчику-спортсмену: русские требуют поставить её на место

The community behind the PC port of Ocarina of Time have been secretly working on a native version of Star Fox 64

Митрополит Матфей поздравил духовника обители милосердия в Новоржевском округе Иоанна Миронова с 98-летием

"Зенит" обыграл "Динамо" в Москве: счет 3-1 в пользу гостей

Полюс Папанина. Как полярники с дрейфующей станции стали народными кумирами

«Грузовичкоф» на передовой новых коллабораций с блогерами: выступление Наталии Поникаровской на конференции The Trends

Филиал № 4 ОСФР по Москве и Московской области информирует: В Москве и Московской области 650 тысяч пенсионеров старше 80 лет получают пенсию в повышенном размере

«Микробиотики микст» с антоцианами удостоены золотой медали на Международном Конкурсе качества

Shot: квартиру Ротару в Москве за 65 млн рублей сняли с продажи

Менеджер Песни. Менеджер Релиза Песни.

«Он считает себя идеальным». Оксана Самойлова обвинила Джигана в нарциссизме в новом выпуске «Большого переселения» на ТНТ

Кубок Дэвиса. 1/2 финала. Ван де Зандшульп сыграет с Альтмайером, Грикспор встретится со Штруффом

Кубок Дэвиса — 2024: церемония прощания Рафаэля Надаля с теннисом вызвала критику, почему не приехал Новак Джокович

Миранчук с «Атлантой» выбыл из плей-офф МЛС, Синнер выиграл Кубок Дэвиса. Главное к утру

Теннисисты из Италии второй раз подряд выиграли Кубок Дэвиса

С глаз долой, из сердца - вон: что делают россияне с подарками бывших

Филиал № 4 ОСФР по Москве и Московской области информирует: В Москве и Московской области 650 тысяч пенсионеров старше 80 лет получают пенсию в повышенном размере

«Грузовичкоф» на передовой новых коллабораций с блогерами: выступление Наталии Поникаровской на конференции The Trends

Bloody - участник и технический партнер Red Expo-2024

Певец Кристовский посетил пункт отбора граждан на военную службу в Москве

Последние новости digital-сферы и финансов Казахстана

Рэпер Guf анонсировал прощальный тур в 2025 году

Посол Ирана призвал наказать полицейских, задержавших иранских студентов в Казани

ГАБТ: около 1,5 тысячи билетов на "Щелкунчик" отданы участникам СВО

Томичи стали чемпионами мира по универсальному бою

Митрополит Матфей поздравил духовника обители милосердия в Новоржевском округе Иоанна Миронова с 98-летием

Нижегородский ХК «Торпедо» разгромил «Спартак»

Новый альбом Linkin Park занял вторую строчку чарта Billboard 200

Michail Antonio reveals he was barred from entering the UK after passport blunder in nightmare international break

Friends of Today24