News Every Day | 14:02

OpenAI’s Strawberry program is reportedly capable of reasoning. It might be able to deceive humans

OpenAI, the company that made ChatGPT, has launched a new artificial intelligence (AI) system called Strawberry. It is designed not just to provide quick responses to questions, like ChatGPT, but to think or “reason”.

This raises several major concerns. If Strawberry really is capable of some form of reasoning, could this AI system cheat and deceive humans?

OpenAI can program the AI in ways that mitigate its ability to manipulate humans. But the company’s own evaluations rate it as a “medium risk” for its ability to assist experts in the “operational planning of reproducing a known biological threat” – in other words, a biological weapon. It was also rated as a medium risk for its ability to persuade humans to change their thinking.

It remains to be seen how such a system might be used by those with bad intentions, such as con artists or hackers. Nevertheless, OpenAI’s evaluation states that medium-risk systems can be released for wider use – a position I believe is misguided.

Strawberry is not one AI “model”, or program, but several – known collectively as o1. These models are intended to answer complex questions and solve intricate maths problems. They are also capable of writing computer code – to help you make your own website or app, for example.

An apparent ability to reason might come as a surprise to some, since this is generally considered a precursor to judgment and decision making – something that has often seemed a distant goal for AI. So, on the surface at least, it would seem to move artificial intelligence a step closer to human-like intelligence.

When things look too good to be true, there’s often a catch. Well, this set of new AI models is designed to maximise their goals. What does this mean in practice? To achieve its desired objective, the path or the strategy chosen by AI may not always necessarily be fair, or align with human values.

True intentions

For example, if you were to play chess against Strawberry, in theory, could its reasoning allow it to hack the scoring system rather than figure out the best strategies for winning the game?

The AI might also be able to lie to humans about its true intentions and capabilities, which would pose a serious safety concern if it were to be deployed widely. For example, if the AI knew it was infected with malware, could it “choose” to conceal this fact in the knowledge that a human operator might opt to disable the whole system if they knew?

Strawberry goes a step beyond the capabilities of AI chatbots. Robert Way / Shutterstock

These would be classic examples of unethical AI behaviour, where cheating or deceiving is acceptable if it leads to a desired goal. It would also be quicker for the AI, as it wouldn’t have to waste any time figuring out the next best move. It may not necessarily be morally correct, however.

This leads to a rather interesting yet worrying discussion. What level of reasoning is Strawberry capable of and what could its unintended consequences be? A powerful AI system that’s capable of cheating humans could pose serious ethical, legal and financial risks to us.

Such risks become grave in critical situations, such as designing weapons of mass destruction. OpenAI rates its own Strawberry models as “medium risk” for their potential to assist scientists in developing chemical, biological, radiological and nuclear weapons.

OpenAI says: “Our evaluations found that o1-preview and o1-mini can help experts with the operational planning of reproducing a known biological threat.” But it goes on to say that experts already have significant expertise in these areas, so the risk would be limited in practice. It adds: “The models do not enable non-experts to create biological threats, because creating such a threat requires hands-on laboratory skills that the models cannot replace.”

Powers of persuasion

OpenAI’s evaluation of Strawberry also investigated the risk that it could persuade humans to change their beliefs. The new o1 models were found to be more persuasive and more manipulative than ChatGPT.

OpenAI also tested a mitigation system that was able to reduce the manipulative capabilities of the AI system. Overall, Strawberry was labelled a medium risk for “persuasion” in Open AI’s tests.

Strawberry was rated low risk for its ability to operate autonomously and on cybersecurity.

Open AI’s policy states that “medium risk” models can be released for wide use. In my view, this underestimates the threat. The deployment of such models could be catastrophic, especially if bad actors manipulate the technology for their own pursuits.

This calls for strong checks and balances that will only be possible through AI regulation and legal frameworks, such as penalising incorrect risk assessments and the misuse of AI.

The UK government stressed the need for “safety, security and robustness” in their 2023 AI white paper, but that’s not nearly enough. There is an urgent need to prioritise human safety and devise rigid scrutiny protocols for AI models such as Strawberry.

Shweta Singh does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.

Москва

Спецконтейнеры для сбора сжатых пластиковых бутылок выпустили в Подмосковье

Today24.pro

Inexperienced Secret service agent called tech support hotline for help piloting drone ahead of Trump rally shooting: bombshell report

Elle King shares major life update after opening up about 'toxic' relationship with dad Rob Schneider

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs

Mum leaves people raging over VERY unique baby moniker, as they remind her she’s ‘naming kids, not Hungry Hippos’

Ria.city

Read also

4 hours ago

Iconic fizzy drink brand to be ‘retired’ leaving fans fearing it will be discontinued

8 hours ago

‘What a twist!’ fume Great British Bake Off fans as they slam major format change

6 hours ago

Pope Francis, back from flu, calls airstrikes on Lebanon ‘unacceptable’

Moscow.media

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs

Today24.pro — latest news 24/7. You can add your news instantly now — here

News Every Day

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs

Sports today

Новости тенниса

Кубок Лейвера

Медведев в составе сборной Европы завоевал Кубок Лейвера

News.tennis

Теннисист Надаль вошел в состав сборной Испании на Кубок Дэвиса Рейтинг WTA. Эрика Андреева обновила личный рекорд, Саккари выпала из топ-15, Шрамкова поднялась на 41 строчку Токио (ATP). 1-й круг. Хуркач сыграет с Гироном, Берреттини – с ван де Зандшульпом Александр Зверев снялся с турнира ATP-500 в Пекине

Спорт в России и мире

Москва

На матче "ЦСКА-Динамо" родилась новая семья

All sports news today

Sports in Russia today

Москва

Кадеты Пермского президентского кадетского училища стали участниками первого этапа Всероссийской олимпиады школьников

Новости России

Game News

Мультиплеерный данжен-кроулер Greedy Wizards: Speed Dungeon вышел в новой стране на iOS и Android

Russian.city

Москва

На матче "ЦСКА-Динамо" родилась новая семья

Губернаторы России

БГАТОиБ

В Бурятском театре оперы и балета появилась бонусная программа

News-life

Агния Кузнецова в шоу «Вкусно с Анфисой Чеховой» рассказала, как убедила Балабанова взять на роль её однокурсника

Алексей Фурсин рассказал о проекте московского кинокластера

МегаФон: Каждый третий верит фишерам

Poisk-music.ru

Кажетта Ахметжанова: какие обереги помогают от сглаза

«Она все загубила»: Собчак назвала супругу Шнурова причиной крушения его карьеры

Нарядившуюся в мини пышную Нетребко раскритиковали за нелепость

Песня жителя Абдулино "Оренбургская красавица" украсила открытие международного форума

News.tennis

Дарья Касаткина поднялась на две позиции в мировом рейтинге

Касаткина проиграла Хаддад-Майе в финале турнира WTA 500 в Сеуле

Вероника Кудерметова победила Викторию Томову и пробилась в полуфинал WTA-500 в Сеуле

Теннисист Надаль вошел в состав сборной Испании на Кубок Дэвиса

Russian.city

На матче "ЦСКА-Динамо" родилась новая семья

ЗАМЕСТИТЕЛЬ ДИРЕКТОРА РОСГВАРДИИ ГЕНЕРАЛ-ПОЛКОВНИК ПОЛИЦИИ АНАТОЛИЙ МАЛИКОВ СОВЕРШИЛ РАБОЧУЮ ПОЕЗДКУ В КАЛУЖСКУЮ ОБЛАСТЬ

Представители Росгвардии приняли участие в совещании Комитета Совета Федерации по обороне и безопасности

Обзор известных приложений, созданных на iOS

Bigpot.news

Собянин: 14 паркам Москвы исполняется более 50 лет в 2024 году

Квартиры, бизнес, машины. Шнуров с женой могут делить имущество на миллиард

Можно ли стирать шторы: возможные риски

ТАСС: Лавров прибыл в Нью-Йорк на Генассамблею ООН

29ru.net

Туристический форум для школьников и педагогов пройдет в Мытищах 26 сентября

Более 16 тысяч человек сделали прививку от гриппа в Солнечногорске

Севморпуть может обеспечить 12-21 трлн рублей налоговых поступлений

Охотникам Подмосковья напомнили правила ношения оружия в преддверии сезона охоты

Путин в России и мире

Russia24.pro

На матче "ЦСКА-Динамо" родилась новая семья Обзор известных приложений, созданных на iOS Филиал № 4 ОСФР по Москве и Московской области информирует: Социальный фонд выплатит остатки материнского капитала менее 10 тысяч рублей Представители Росгвардии приняли участие в совещании Комитета Совета Федерации по обороне и безопасности

Life24.pro

В Подмосковье росгвардейцы задержали гражданина, находящегося в розыске. В городском округе Домодедово проведена агитационно-разъяснительная работа с населением о сохранности имущества. Преданья старины глубокой под тропический коктейль Парализуют глистов: врач Садыков подтвердил пользу тыквенных семян

Агрегатор новостей 24СМИ

123ru.net

Свыше 100 работ представят на выставке художницы Марины Домниковой в Мытищах В Москве появится уникальный спектакль в жанре фэнтези с популярными артистами Оплатить проезд по трассе М12 теперь можно на портале госуслуг КОМПАНИЯ LG ПРОВОДИТ ПЕРВУЮ ВСТРЕЧУ ГЛОБАЛЬНОГО КОНСОРЦИУМА ДЛЯ УСИЛЕНИЯ ТЕХНОЛОГИЧЕСКОГО СОТРУДНИЧЕСТВА

Персональные новости

Today24.pro

Inexperienced Secret service agent called tech support hotline for help piloting drone ahead of Trump rally shooting: bombshell report Elle King shares major life update after opening up about 'toxic' relationship with dad Rob Schneider Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs Morning Briefing: Mets Keep Ground in Wild Card Race Despite Loss

Russian.city

Любовь Успенская

Успенская отсудила у обозвавшего её "попрошайкой" Киркорова 90 тысяч рублей

Агрегатор новостей 24СМИ

News Every Day

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs

Today24.pro

Inexperienced Secret service agent called tech support hotline for help piloting drone ahead of Trump rally shooting: bombshell report Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs Elle King shares major life update after opening up about 'toxic' relationship with dad Rob Schneider Morning Briefing: Mets Keep Ground in Wild Card Race Despite Loss

123ru.net

Оплатить проезд по трассе М12 теперь можно на портале госуслуг Бастрыкин заинтересовался делом мигранта, оскорблявшего русских в Подмосковье Свыше 100 работ представят на выставке художницы Марины Домниковой в Мытищах Новые тенденции в запросах покупателей новостроек

Friends of Today24

Музыкальные новости

Агрегатор новостей 24СМИ

Персональные новости