We in Telegram
Add news
March 2010 April 2010 May 2010 June 2010 July 2010
August 2010
September 2010 October 2010
November 2010
December 2010
January 2011
February 2011 March 2011 April 2011 May 2011 June 2011 July 2011 August 2011 September 2011 October 2011 November 2011 December 2011 January 2012 February 2012 March 2012 April 2012 May 2012 June 2012 July 2012 August 2012 September 2012 October 2012 November 2012 December 2012 January 2013 February 2013 March 2013 April 2013 May 2013 June 2013 July 2013 August 2013 September 2013 October 2013 November 2013 December 2013 January 2014 February 2014 March 2014 April 2014 May 2014 June 2014 July 2014 August 2014 September 2014 October 2014 November 2014 December 2014 January 2015 February 2015 March 2015 April 2015 May 2015 June 2015 July 2015 August 2015 September 2015 October 2015 November 2015 December 2015 January 2016 February 2016 March 2016 April 2016 May 2016 June 2016 July 2016 August 2016 September 2016 October 2016 November 2016 December 2016 January 2017 February 2017 March 2017 April 2017 May 2017 June 2017 July 2017 August 2017 September 2017 October 2017 November 2017 December 2017 January 2018 February 2018 March 2018 April 2018 May 2018 June 2018 July 2018 August 2018 September 2018 October 2018 November 2018 December 2018 January 2019 February 2019 March 2019 April 2019 May 2019 June 2019 July 2019 August 2019 September 2019 October 2019 November 2019 December 2019 January 2020 February 2020 March 2020 April 2020 May 2020 June 2020 July 2020 August 2020 September 2020 October 2020 November 2020 December 2020 January 2021 February 2021 March 2021 April 2021 May 2021 June 2021 July 2021 August 2021 September 2021 October 2021 November 2021 December 2021 January 2022 February 2022 March 2022 April 2022 May 2022 June 2022 July 2022 August 2022 September 2022 October 2022 November 2022 December 2022 January 2023 February 2023 March 2023 April 2023 May 2023 June 2023 July 2023 August 2023 September 2023 October 2023 November 2023 December 2023 January 2024 February 2024 March 2024 April 2024 May 2024
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
21
22
23
24
25
26
27
28
29
30
31
News Every Day |

AI systems are getting better at tricking us

A wave of AI systems have “deceived” humans in ways they haven’t been explicitly trained to do, by offering up untrue explanations for their behavior or concealing the truth from human users and misleading them to achieve a strategic end. 

This issue highlights how difficult artificial intelligence is to control and the unpredictable ways in which these systems work, according to a review paper published in the journal Patterns today that summarizes previous research.

Talk of deceiving humans might suggest that these models have intent. They don’t. But AI models will mindlessly find workarounds to obstacles to achieve the goals that have been given to them. Sometimes these workarounds will go against users’ expectations and feel deceitful.

One area where AI systems have learned to become deceptive is within the context of games that they’ve been trained to win—specifically if those games involve having to act strategically.

In November 2022, Meta announced it had created Cicero, an AI capable of beating humans at an online version of Diplomacy, a popular military strategy game in which players negotiate alliances to vie for control of Europe.

Meta’s researchers said they’d trained Cicero on a “truthful” subset of its data set to be largely honest and helpful, and that it would “never intentionally backstab” its allies in order to succeed. But the new paper’s authors claim the opposite was true: Cicero broke its deals, told outright falsehoods, and engaged in premeditated deception. Although the company did try to train Cicero to behave honestly, its failure to achieve that shows how AI systems can still unexpectedly learn to deceive, the authors say. 

Meta neither confirmed nor denied the researchers’ claims that Cicero displayed deceitful behavior, but a spokesperson said that it was purely a research project and the model was built solely to play Diplomacy. “We released artifacts from this project under a noncommercial license in line with our long-standing commitment to open science,” they say. “Meta regularly shares the results of our research to validate them and enable others to build responsibly off of our advances. We have no plans to use this research or its learnings in our products.” 

But it’s not the only game where an AI has “deceived” human players to win. 

AlphaStar, an AI developed by DeepMind to play the video game StarCraft II, became so adept at making moves aimed at deceiving opponents (known as feinting) that it defeated 99.8% of human players. Elsewhere, another Meta system called Pluribus learned to bluff during poker games so successfully that the researchers decided against releasing its code for fear it could wreck the online poker community. 

Beyond games, the researchers list other examples of deceptive AI behavior. GPT-4, OpenAI’s latest large language model, came up with lies during a test in which it was prompted to persuade a human to solve a CAPTCHA for it. The system also dabbled in insider trading during a simulated exercise in which it was told to assume the identity of a pressurized stock trader, despite never being specifically instructed to do so.

The fact that an AI model has the potential to behave in a deceptive manner without any direction to do so may seem concerning. But it mostly arises from the “black box” problem that characterizes state-of-the-art machine-learning models: it is impossible to say exactly how or why they produce the results they do—or whether they’ll always exhibit that behavior going forward, says Peter S. Park, a postdoctoral fellow studying AI existential safety at MIT, who worked on the project. 

“Just because your AI has certain behaviors or tendencies in a test environment does not mean that the same lessons will hold if it’s released into the wild,” he says. “There’s no easy way to solve this—if you want to learn what the AI will do once it’s deployed into the wild, then you just have to deploy it into the wild.”

Our tendency to anthropomorphize AI models colors the way we test these systems and what we think about their capabilities. After all, passing tests designed to measure human creativity doesn’t mean AI models are actually being creative. It is crucial that regulators and AI companies carefully weigh the technology’s potential to cause harm against its potential benefits for society and make clear distinctions between what the models can and can’t do, says Harry Law, an AI researcher at the University of Cambridge, who did not work on the research.“These are really tough questions,” he says.

Fundamentally, it’s currently impossible to train an AI model that’s incapable of deception in all possible situations, he says. Also, the potential for deceitful behavior is one of many problems—alongside the propensity to amplify bias and misinformation—that need to be addressed before AI models should be trusted with real-world tasks. 

“This is a good piece of research for showing that deception is possible,” Law says. “The next step would be to try and go a little bit further to figure out what the risk profile is, and how likely the harms that could potentially arise from deceptive behavior are to occur, and in what way.”

AML check crypto

Ballroom culture coming to the Long Beach Pride Festival

Gunmen open fire and kill 4 people, including 3 foreigners, in Afghanistan's central Bamyan province

Glen Powell’s parents crash Texas movie screening to troll him

Ria.city






Read also

Worker killed in forklift accident in Cheektowaga

LA Fleet Week: If you like tall ships, then you won’t want to miss the Festival of Sail

Look up

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Ballroom culture coming to the Long Beach Pride Festival

Today24.pro — latest news 24/7. You can add your news instantly now — here


News Every Day

AML check crypto



Sports today


Новости тенниса
Камила Джорджи

Экс‑теннисистка Джорджи обвиняется в краже мебели и ковров на €100 тысяч — СМИ



Спорт в России и мире
Москва

Росгвардейцы обеспечили правопорядок во время футбольных матчей в Москве



All sports news today





Sports in Russia today

Москва

Футболисты ФК «Динамо Пушкино» заняли второе место в межрегиональном турнире Laffy Cup 2024


Новости России

Game News

Five new Steam games you probably missed (May 20, 2024)


Russian.city


Москва

«СВЯТОЙ ЛЕНИН» помогает В.В. Путину улучшить либо отменить налоги в обществе.


Губернаторы России
Константин Малофеев

Малофеев об акциях молодёжи в десятках русских городов: "Будущее у нашей Империи есть"


Шапки женские вязаные на Wildberries, 2024 — новый цвет от 392 руб. (модель 466)

Sohu: Пекин удивлен первым действием Путина по возвращению из Китая

Академик Нетесов назвал животных, являющихся переносчиками крысиного гепатита

Что там в IT: ИИ-отрыв Google, ChatGPT почти человек, отечественный BIOS


В ЦТК концерт, посвященный юбилею Алсу Альметовой

Мама Тимати ответила на провокационные вопросы о Валентине Ивановой и других девушках своего сына

Мама Тимати объяснила, почему внуки Ратмир и Алиса могут не сблизиться

Карди Би все-таки выпустит альбом в 2024 году


Российский теннисист Медведев опустится на строчку в рейтинге ATP

Рыбакина узнала свое место в новом мировом рейтинге

Даниил Медведев идет третьим в чемпионской гонке ATP, Андрей Рублев — пятый

Рахимова прошла во второй круг турнира WTA в Рабате на отказе Таунсенд



Эксперт Президентской академии в Санкт-Петербурге о допфинансировании на модернизацию и строительство школ в регионах

Мистический Тибет: путеводитель по местам силы от Кажетты Ахметжановой

«СВЯТОЙ ЛЕНИН» помогает В.В. Путину улучшить либо отменить налоги в обществе.

Энергетики «Россети Центр» и «Россети Центр и Приволжье» стали участниками полумарафона «Забег. РФ»


Бурятский театр «Ульгэр» показал на выставке «Театральная весна» кукольный постановку: Россия, Дети, нацпроект Культура

Против незаконного визита Лукашенко в оккупированный и деарменизированный Карабах высказался не МИД Армении, а Тихановская. Фоторяд

Создание Сайтов. Создание веб сайта. Создание сайта html. Создание сайтов цена. Создание и продвижение сайтов. Создание сайта с нуля. Создание интернет сайта.

«СВЯТОЙ ЛЕНИН» помогает В.В. Путину улучшить либо отменить налоги в обществе.


Следующий сезон ВХЛ станет на 1 месяц длиннее, в нем сыграют 33 команды. ХК «Южный Урал» заявлен

Политолог Асафов: праймериз ЕР - реализация запроса на обновление партии

Г.А. Зюганов встретился в Москве с китайским академиком-марксистом Чен Эньфу

В Кашире переданы участки для строительства новых тепловых пунктов и котельных



Путин в России и мире






Персональные новости Russian.city
Булат Окуджава

В Душанбе открылась книжная выставка, посвящённая Юлии Друниной и Булату Окуджаве



News Every Day

Glen Powell’s parents crash Texas movie screening to troll him




Friends of Today24

Музыкальные новости

Персональные новости