March 2010 April 2010 May 2010 June 2010 July 2010
August 2010
September 2010 October 2010
November 2010
December 2010 January 2011 February 2011 March 2011 April 2011 May 2011 June 2011 July 2011 August 2011 September 2011 October 2011 November 2011 December 2011 January 2012 February 2012 March 2012 April 2012 May 2012 June 2012 July 2012 August 2012 September 2012 October 2012 November 2012 December 2012 January 2013 February 2013 March 2013 April 2013 May 2013 June 2013 July 2013 August 2013 September 2013 October 2013 November 2013 December 2013 January 2014 February 2014 March 2014 April 2014 May 2014 June 2014 July 2014 August 2014 September 2014 October 2014 November 2014 December 2014 January 2015 February 2015 March 2015 April 2015 May 2015 June 2015 July 2015 August 2015 September 2015 October 2015 November 2015 December 2015 January 2016 February 2016 March 2016 April 2016 May 2016 June 2016 July 2016 August 2016 September 2016 October 2016 November 2016 December 2016 January 2017 February 2017 March 2017 April 2017 May 2017 June 2017 July 2017 August 2017 September 2017 October 2017 November 2017 December 2017 January 2018 February 2018 March 2018 April 2018 May 2018 June 2018 July 2018 August 2018 September 2018 October 2018 November 2018 December 2018 January 2019 February 2019 March 2019 April 2019 May 2019 June 2019 July 2019 August 2019 September 2019 October 2019 November 2019 December 2019 January 2020 February 2020 March 2020 April 2020 May 2020 June 2020 July 2020 August 2020 September 2020 October 2020 November 2020 December 2020 January 2021 February 2021 March 2021 April 2021 May 2021 June 2021 July 2021 August 2021 September 2021 October 2021 November 2021 December 2021 January 2022 February 2022 March 2022 April 2022 May 2022 June 2022 July 2022 August 2022 September 2022 October 2022 November 2022 December 2022 January 2023 February 2023 March 2023 April 2023 May 2023 June 2023 July 2023 August 2023 September 2023 October 2023 November 2023 December 2023 January 2024 February 2024 March 2024 April 2024 May 2024 June 2024 July 2024 August 2024 September 2024 October 2024
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
25
26
27
28
29
30
31
News Every Day |

Reckoning with generative AI’s uncanny valley

Generative AI has the power to surprise in a way that few other technologies can. Sometimes that’s a very good thing; other times, not so good. In theory, as generative AI improves, this issue should become less important. However, in reality, as generative AI becomes more “human” it can begin to turn sinister and unsettling, plunging us into what robotics has long described as the “uncanny valley.”

It might be tempting to overlook this experience as something that can be corrected by bigger data sets or better training. However, insofar as it speaks to a disturbance in our mental model of the technology (e.g., I don’t like what it did there) it’s something that needs to be acknowledged and addressed.

Mental models and antipatterns

Mental models are an important concept in UX and product design, but they need to be more readily embraced by the AI community. At one level, mental models often don’t appear because they are routine patterns of our assumptions about an AI system. This is something we discussed at length in the process of putting together the latest volume of the Thoughtworks Technology Radar, a biannual report based on our experiences working with clients all over the world.

For instance, we called out complacency with AI generated code and replacing pair programming with generative AI as two practices we believe practitioners must avoid as the popularity of AI coding assistants continues to grow. Both emerge from poor mental models that fail to acknowledge how this technology actually works and its limitations. The consequences are that the more convincing and “human” these tools become, the harder it is for us to acknowledge how the technology actually works and the limitations of the “solutions” it provides us.

Of course, for those deploying generative AI into the world, the risks are similar, perhaps even more pronounced. While the intent behind such tools is usually to create something convincing and usable, if such tools mislead, trick, or even merely unsettle users, their value and worth evaporates. It’s no surprise that legislation, such as the EU AI Act, which requires of deep fake creators to label content as “AI generated,” is being passed to address these problems.

It’s worth pointing out that this isn’t just an issue for AI and robotics. Back in 2011, our colleague Martin Fowler wrote about how certain approaches to building cross platform mobile applications can create an uncanny valley, “where things work mostly like… native controls but there are just enough tiny differences to throw users off.”

Specifically, Fowler wrote something we think is instructive: “different platforms have different ways they expect you to use them that alter the entire experience design.” The point here, applied to generative AI, is that different contexts and different use cases all come with different sets of assumptions and mental models that change at what point users might drop into the uncanny valley. These subtle differences change one’s experience or perception of a large language model’s (LLM) output.

For example, for the drug researcher that wants vast amounts of synthetic data, accuracy at a micro level may be unimportant; for the lawyer trying to grasp legal documentation, accuracy matters a lot. In fact, dropping into the uncanny valley might just be the signal to step back and reassess your expectations.

Shifting our perspective

The uncanny valley of generative AI might be troubling, even something we want to minimize, but it should also remind us of generative AI’s limitations—it should encourage us to rethink our perspective.

There have been some interesting attempts to do that across the industry. One that stands out is Ethan Mollick, a professor at the University of Pennsylvania, who argues that AI shouldn’t be understood as good software but instead as “pretty good people.”

Therefore, our expectations about what generative AI can do and where it’s effective must remain provisional and should be flexible. To a certain extent, this might be one way of overcoming the uncanny valley—by reflecting on our assumptions and expectations, we remove the technology’s power to disturb or confound them.

However, simply calling for a mindset shift isn’t enough. There are various practices and tools that can help. One example is the technique, which we identified in the latest Technology Radar, of getting structured outputs from LLMs. This can be done by either instructing a model to respond in a particular format when prompting or through fine-tuning. Thanks to tools like Instructor, it is getting easier to do that and creates greater alignment between expectations and what the LLM will output. While there’s a chance something unexpected or not quite right might happen, this technique goes some way to addressing that.

There are other techniques too, including retrieval augmented generation as a way of better controlling the “context window.” There are frameworks and tools that can help evaluate and measure the success of such techniques, including Ragas and DeepEval, which are libraries that provide AI developers with metrics for faithfulness and relevance.

Measurement is important, as are relevant guidelines and policies for LLMs, such as LLM guardrails. It’s important to take steps to better understand what’s actually happening inside these models. Completely unpacking these black boxes might be impossible, but tools like Langfuse can help. Doing so may go a long way in reorienting the relationship with this technology, shifting mental models, and removing the possibility of falling into the uncanny valley.

An opportunity, not a flaw

These tools—part of a Cambrian explosion of generative AI tools—can help practitioners rethink generative AI and, hopefully, build better and more responsible products. However, for the wider world, this work will remain invisible. What’s important is exploring how we can evolve toolchains to better control and understand generative AI, even though existing mental models and conceptions of generative AI are a fundamental design problem, not a marginal issue we can choose to ignore.

Ken Mugrage is the principal technologist in the office of the CTO at Thoughtworks. Srinivasan Raguraman is a technical principal at Thoughtworks based in Singapore.

This content was produced by Thoughtworks. It was not written by MIT Technology Review’s editorial staff.

Москва

Обзор Международной конференции по горному туризму и активным видам спорта на открытом воздухе 2024 года

Idris Elba plans relocation to Africa to boost film industry

Gary Neville starts new job with Man Utd just days after club legend Sir Alex Ferguson was axed by Jim Ratcliffe

'Showing wrong map of India': NZ Cricket slammed ahead of 2nd Test

The growing role of AI in the shipping industry

Ria.city






Read also

UK to ban disposable vapes from next year to crack down on teen use

The economy is a priority for Americans as they head to the polls. Here's what's really going on behind the numbers.

How Patriots Players Reacted To Bill Belichick’s Comments

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Gary Neville starts new job with Man Utd just days after club legend Sir Alex Ferguson was axed by Jim Ratcliffe

Today24.pro — latest news 24/7. You can add your news instantly now — here


News Every Day

The growing role of AI in the shipping industry



Sports today


Новости тенниса
WTA

Россиянка Касаткина вернулась в топ-10 рейтинга WTA по итогам турнира в Нинбо



Спорт в России и мире
Москва

Свадьба под грифом "секретно"



All sports news today





Sports in Russia today

Москва

Спортсмены из Росгвардии стали призерами чемпионата России по дзюдо


Новости России

Game News

Sony carves off more of Bungie: The Creative Studios team is now officially a part of PlayStation Studios


Russian.city


Москва

Мосгорсуд заменил реальный срок экс-замглавы Минпросвещения Раковой на условный


Губернаторы России
50

Подмосковные боксеры выиграли 50 медалей на турнирах в Москве и Туле


Культовая моноопера Пуленка «Человеческий голос» прозвучит в честь юбилея композитора в Санкт-Петербурге

Филиал № 4 ОСФР по Москве и Московской области информирует: Гражданам Москвы и Московской области, получившим тяжелые производственные травмы, выданы автомобили марки «Лада Гранта»

Константин Малофеев: пришел наш черед объявить Западу холодную войну

Земля прогрузится в хаос: грозит ли человечеству изменение климата из-за краха АМОС


Певец Сергей Беликов: «Сегодня бал на эстраде правит поп-культурный фаст-фуд»

Дистрибьюция Музыки. Дистрибьюция Музыки в России.

На Ярославском ЯЭЗ Желдорреммаш прошел аудит Центра технического аудита РЖД (ЦТА)

Снова Глюкоза – задержание в аэропорту: Наркотики или месть бывшего продюсера?


Девушки с характером: Касаткина победила Андрееву в эпичном финале турнира WTA в Нинбо

Теннисистка Касаткина победила Андрееву в финале WTA

Карен Хачанов стал чемпионом турнира ATP-250 в Алма-Ате

Денис Шаповалов с уверенной победы стартовал на турнире ATP-500 в Базеле



Вадим Эйленкриг передаст джазовый привет Луи Армстронгу в Государственной академической капелле Санкт-Петербурга

Культовая моноопера Пуленка «Человеческий голос» прозвучит в честь юбилея композитора в Санкт-Петербурге

Портал о гостеприимстве HotelPresent.ru: идеальное место для отельера подробно рассказать о своем отеле

«Деловые Линии» запустили авиаперевозки из Армении


Путин проводит переговоры с президентом Египта на полях БРИКС в Казани

Си Цзиньпин призвал Россию и Китай укреплять стратегическое сотрудничество

Культовая моноопера Пуленка «Человеческий голос» прозвучит в честь юбилея композитора в Санкт-Петербурге

Экзаменационный центр Желдорреммаш успешно прошел аттестацию и расширил виды контроля


The Washington Post: экс-морпех США Дуган обвиняется в работе на разведку РФ

Конкурс исполнителей патриотической песни пройдет в Зарайске 3 ноября

Сергунина рассказала о присуждении Москве премии «Умный город»

Страны БРИКС объединяют усилия в продвижении экологически безопасного и низкоуглеродного развития



Путин в России и мире






Персональные новости Russian.city
Антонио Вивальди

«Времена года» Антонио Вивальди прозвучат 4 ноября в Эрмитажном театре Петербурга



News Every Day

The growing role of AI in the shipping industry




Friends of Today24

Музыкальные новости

Персональные новости