March 2010 April 2010 May 2010 June 2010 July 2010
August 2010
September 2010 October 2010
November 2010
December 2010 January 2011 February 2011 March 2011 April 2011 May 2011 June 2011 July 2011 August 2011 September 2011 October 2011 November 2011 December 2011 January 2012 February 2012 March 2012 April 2012 May 2012 June 2012 July 2012 August 2012 September 2012 October 2012 November 2012 December 2012 January 2013 February 2013 March 2013 April 2013 May 2013 June 2013 July 2013 August 2013 September 2013 October 2013 November 2013 December 2013 January 2014 February 2014 March 2014 April 2014 May 2014 June 2014 July 2014 August 2014 September 2014 October 2014 November 2014 December 2014 January 2015 February 2015 March 2015 April 2015 May 2015 June 2015 July 2015 August 2015 September 2015 October 2015 November 2015 December 2015 January 2016 February 2016 March 2016 April 2016 May 2016 June 2016 July 2016 August 2016 September 2016 October 2016 November 2016 December 2016 January 2017 February 2017 March 2017 April 2017 May 2017 June 2017 July 2017 August 2017 September 2017 October 2017 November 2017 December 2017 January 2018 February 2018 March 2018 April 2018 May 2018 June 2018 July 2018 August 2018 September 2018 October 2018 November 2018 December 2018 January 2019 February 2019 March 2019 April 2019 May 2019 June 2019 July 2019 August 2019 September 2019 October 2019 November 2019 December 2019 January 2020 February 2020 March 2020 April 2020 May 2020 June 2020 July 2020 August 2020 September 2020 October 2020 November 2020 December 2020 January 2021 February 2021 March 2021 April 2021 May 2021 June 2021 July 2021 August 2021 September 2021 October 2021 November 2021 December 2021 January 2022 February 2022 March 2022 April 2022 May 2022 June 2022 July 2022 August 2022 September 2022 October 2022 November 2022 December 2022 January 2023 February 2023 March 2023 April 2023 May 2023 June 2023 July 2023 August 2023 September 2023 October 2023 November 2023 December 2023 January 2024 February 2024 March 2024 April 2024 May 2024 June 2024 July 2024 August 2024 September 2024
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
26
27
28
29
30
News Every Day |

OpenAI’s Strawberry program is reportedly capable of reasoning. It might be able to deceive humans

OpenAI, the company that made ChatGPT, has launched a new artificial intelligence (AI) system called Strawberry. It is designed not just to provide quick responses to questions, like ChatGPT, but to think or “reason”.

This raises several major concerns. If Strawberry really is capable of some form of reasoning, could this AI system cheat and deceive humans?

OpenAI can program the AI in ways that mitigate its ability to manipulate humans. But the company’s own evaluations rate it as a “medium risk” for its ability to assist experts in the “operational planning of reproducing a known biological threat” – in other words, a biological weapon. It was also rated as a medium risk for its ability to persuade humans to change their thinking.

It remains to be seen how such a system might be used by those with bad intentions, such as con artists or hackers. Nevertheless, OpenAI’s evaluation states that medium-risk systems can be released for wider use – a position I believe is misguided.

Strawberry is not one AI “model”, or program, but several – known collectively as o1. These models are intended to answer complex questions and solve intricate maths problems. They are also capable of writing computer code – to help you make your own website or app, for example.

An apparent ability to reason might come as a surprise to some, since this is generally considered a precursor to judgment and decision making – something that has often seemed a distant goal for AI. So, on the surface at least, it would seem to move artificial intelligence a step closer to human-like intelligence.

When things look too good to be true, there’s often a catch. Well, this set of new AI models is designed to maximise their goals. What does this mean in practice? To achieve its desired objective, the path or the strategy chosen by AI may not always necessarily be fair, or align with human values.

True intentions

For example, if you were to play chess against Strawberry, in theory, could its reasoning allow it to hack the scoring system rather than figure out the best strategies for winning the game?

The AI might also be able to lie to humans about its true intentions and capabilities, which would pose a serious safety concern if it were to be deployed widely. For example, if the AI knew it was infected with malware, could it “choose” to conceal this fact in the knowledge that a human operator might opt to disable the whole system if they knew?

Strawberry goes a step beyond the capabilities of AI chatbots. Robert Way / Shutterstock

These would be classic examples of unethical AI behaviour, where cheating or deceiving is acceptable if it leads to a desired goal. It would also be quicker for the AI, as it wouldn’t have to waste any time figuring out the next best move. It may not necessarily be morally correct, however.

This leads to a rather interesting yet worrying discussion. What level of reasoning is Strawberry capable of and what could its unintended consequences be? A powerful AI system that’s capable of cheating humans could pose serious ethical, legal and financial risks to us.

Such risks become grave in critical situations, such as designing weapons of mass destruction. OpenAI rates its own Strawberry models as “medium risk” for their potential to assist scientists in developing chemical, biological, radiological and nuclear weapons.

OpenAI says: “Our evaluations found that o1-preview and o1-mini can help experts with the operational planning of reproducing a known biological threat.” But it goes on to say that experts already have significant expertise in these areas, so the risk would be limited in practice. It adds: “The models do not enable non-experts to create biological threats, because creating such a threat requires hands-on laboratory skills that the models cannot replace.”

Powers of persuasion

OpenAI’s evaluation of Strawberry also investigated the risk that it could persuade humans to change their beliefs. The new o1 models were found to be more persuasive and more manipulative than ChatGPT.

OpenAI also tested a mitigation system that was able to reduce the manipulative capabilities of the AI system. Overall, Strawberry was labelled a medium risk for “persuasion” in Open AI’s tests.

Strawberry was rated low risk for its ability to operate autonomously and on cybersecurity.

Open AI’s policy states that “medium risk” models can be released for wide use. In my view, this underestimates the threat. The deployment of such models could be catastrophic, especially if bad actors manipulate the technology for their own pursuits.

This calls for strong checks and balances that will only be possible through AI regulation and legal frameworks, such as penalising incorrect risk assessments and the misuse of AI.

The UK government stressed the need for “safety, security and robustness” in their 2023 AI white paper, but that’s not nearly enough. There is an urgent need to prioritise human safety and devise rigid scrutiny protocols for AI models such as Strawberry.

Shweta Singh does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.

Москва

Спецконтейнеры для сбора сжатых пластиковых бутылок выпустили в Подмосковье

Inexperienced Secret service agent called tech support hotline for help piloting drone ahead of Trump rally shooting: bombshell report

Elle King shares major life update after opening up about 'toxic' relationship with dad Rob Schneider

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs

Mum leaves people raging over VERY unique baby moniker, as they remind her she’s ‘naming kids, not Hungry Hippos’

Ria.city






Read also

Iconic fizzy drink brand to be ‘retired’ leaving fans fearing it will be discontinued

‘What a twist!’ fume Great British Bake Off fans as they slam major format change

Pope Francis, back from flu, calls airstrikes on Lebanon ‘unacceptable’

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs

Today24.pro — latest news 24/7. You can add your news instantly now — here


News Every Day

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs



Sports today


Новости тенниса
Кубок Лейвера

Медведев в составе сборной Европы завоевал Кубок Лейвера



Спорт в России и мире
Москва

На матче "ЦСКА-Динамо" родилась новая семья



All sports news today





Sports in Russia today

Москва

Кадеты Пермского президентского кадетского училища стали участниками первого этапа Всероссийской олимпиады школьников


Новости России

Game News

Мультиплеерный данжен-кроулер Greedy Wizards: Speed Dungeon вышел в новой стране на iOS и Android


Russian.city


Москва

На матче "ЦСКА-Динамо" родилась новая семья


Губернаторы России
БГАТОиБ

В Бурятском театре оперы и балета появилась бонусная программа


Агния Кузнецова в шоу «Вкусно с Анфисой Чеховой» рассказала, как убедила Балабанова взять на роль её однокурсника

Агния Кузнецова в шоу «Вкусно с Анфисой Чеховой» рассказала, как убедила Балабанова взять на роль её однокурсника

Алексей Фурсин рассказал о проекте московского кинокластера

МегаФон: Каждый третий верит фишерам


Кажетта Ахметжанова: какие обереги помогают от сглаза

«Она все загубила»: Собчак назвала супругу Шнурова причиной крушения его карьеры

Нарядившуюся в мини пышную Нетребко раскритиковали за нелепость

Песня жителя Абдулино "Оренбургская красавица" украсила открытие международного форума


Дарья Касаткина поднялась на две позиции в мировом рейтинге

Касаткина проиграла Хаддад-Майе в финале турнира WTA 500 в Сеуле

Вероника Кудерметова победила Викторию Томову и пробилась в полуфинал WTA-500 в Сеуле

Теннисист Надаль вошел в состав сборной Испании на Кубок Дэвиса



На матче "ЦСКА-Динамо" родилась новая семья

ЗАМЕСТИТЕЛЬ ДИРЕКТОРА РОСГВАРДИИ ГЕНЕРАЛ-ПОЛКОВНИК ПОЛИЦИИ АНАТОЛИЙ МАЛИКОВ СОВЕРШИЛ РАБОЧУЮ ПОЕЗДКУ В КАЛУЖСКУЮ ОБЛАСТЬ

Представители Росгвардии приняли участие в совещании Комитета Совета Федерации по обороне и безопасности

Обзор известных приложений, созданных на iOS


Собянин: 14 паркам Москвы исполняется более 50 лет в 2024 году

Квартиры, бизнес, машины. Шнуров с женой могут делить имущество на миллиард

Можно ли стирать шторы: возможные риски

ТАСС: Лавров прибыл в Нью-Йорк на Генассамблею ООН


Туристический форум для школьников и педагогов пройдет в Мытищах 26 сентября

Более 16 тысяч человек сделали прививку от гриппа в Солнечногорске

Севморпуть может обеспечить 12-21 трлн рублей налоговых поступлений

Охотникам Подмосковья напомнили правила ношения оружия в преддверии сезона охоты



Путин в России и мире






Персональные новости Russian.city
Любовь Успенская

Успенская отсудила у обозвавшего её "попрошайкой" Киркорова 90 тысяч рублей



News Every Day

Eddie Hearn threatens to ‘knock out’ rival promoter in bizarre confrontation on stage at Joshua vs Dubois face-offs




Friends of Today24

Музыкальные новости

Персональные новости