March 2010 April 2010 May 2010 June 2010 July 2010
August 2010
September 2010 October 2010
November 2010
December 2010 January 2011 February 2011 March 2011 April 2011 May 2011 June 2011 July 2011 August 2011 September 2011 October 2011 November 2011 December 2011 January 2012 February 2012 March 2012 April 2012 May 2012 June 2012 July 2012 August 2012 September 2012 October 2012 November 2012 December 2012 January 2013 February 2013 March 2013 April 2013 May 2013 June 2013 July 2013 August 2013 September 2013 October 2013 November 2013 December 2013 January 2014 February 2014 March 2014 April 2014 May 2014 June 2014 July 2014 August 2014 September 2014 October 2014 November 2014 December 2014 January 2015 February 2015 March 2015 April 2015 May 2015 June 2015 July 2015 August 2015 September 2015 October 2015 November 2015 December 2015 January 2016 February 2016 March 2016 April 2016 May 2016 June 2016 July 2016 August 2016 September 2016 October 2016 November 2016 December 2016 January 2017 February 2017 March 2017 April 2017 May 2017 June 2017 July 2017 August 2017 September 2017 October 2017 November 2017 December 2017 January 2018 February 2018 March 2018 April 2018 May 2018 June 2018 July 2018 August 2018 September 2018 October 2018 November 2018 December 2018 January 2019 February 2019 March 2019 April 2019 May 2019 June 2019 July 2019 August 2019 September 2019 October 2019 November 2019 December 2019 January 2020 February 2020 March 2020 April 2020 May 2020 June 2020 July 2020 August 2020 September 2020 October 2020 November 2020 December 2020 January 2021 February 2021 March 2021 April 2021 May 2021 June 2021 July 2021 August 2021 September 2021 October 2021 November 2021 December 2021 January 2022 February 2022 March 2022 April 2022 May 2022 June 2022 July 2022 August 2022 September 2022 October 2022 November 2022 December 2022 January 2023 February 2023 March 2023 April 2023 May 2023 June 2023 July 2023 August 2023 September 2023 October 2023 November 2023 December 2023 January 2024 February 2024 March 2024 April 2024 May 2024 June 2024 July 2024 August 2024 September 2024 October 2024
1 2 3 4 5 6 7 8 9 10 11 12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
News Every Day |

Why The New York Times' lawyers are inspecting OpenAI's code in a secretive room

OpenAI and Microsoft are defending lawsuits from publishers, authors, and other creators alleging they ripped off their work. They can reshape the generative AI industry.
  • Lawyers for The New York Times are poring through ChatGPT's source code and training material.
  • Copyright cases from publishers and authors are trying to figure out how AI trains on creative work.
  • The lawsuits could chart a path forward, much as Napster's legal morass did two decades ago.

Somewhere in the United States, in a secure room, on a computer unconnected to the internet, sits the source code for ChatGPT.

It is there to be inspected by lawyers for The New York Times.

By the order of a federal judge, the lawyers can only get into the room if they show a government-issued ID to a security guard. They are forbidden from bringing in their own phones, flash drives, or any other electronic devices. They're given a computer — also unconnected from the internet — with a word processing program. After each session, their notes can be downloaded to a different computer, and then the original note-taking computer may be wiped.

The Times' lawyers can share their notes with up to five outside consultants to help them understand what the code does. If one of the lawyers wants to show OpenAI CEO Sam Altman a snippet of the code to ask him questions about it for a deposition, that copy will be destroyed afterward.

OpenAI is worth $157 billion largely because of the success of ChatGPT. But to build the chatbot, the company trained its models on vast quantities of text it didn't pay a penny for.

That text includes stories from The New York Times, articles from other publications, and an untold number of copyrighted books.

The examination of the code for ChatGPT, as well as for Microsoft's artificial intelligence models built using OpenAI's technology, is crucial for the copyright infringement lawsuits against the two companies.

Publishers and artists have filed about two dozen major copyright lawsuits against generative AI companies. They are out for blood, demanding a slice of the economic pie that made OpenAI the dominant player in the industry and which pushed Microsoft's valuation beyond $3 trillion. Judges deciding those cases may carve out the legal parameters for how large language models are trained in the US.

"Developers should pay for the valuable publisher content that is used to create and operate their products," a Times spokesperson told BI. "The future success of this technology need not come at the expense of journalistic institutions."

For the lawsuit, the 173-year-old media company employed an elite law firm, Susman Godfrey, which recently won Dominion's mammoth $787.5 million settlement from Fox News. Other lawsuits from newsrooms, including The New York Daily News and Mother Jones, have latched on to the case.

Susman Godfrey is also representing a group of authors including George RR Martin, Jodi Picoult, and Jia Tolentino, who filed copyright claims months before the Times. If a judge certifies their class-action status, an eventual settlement or judgment could have ramifications for virtually every author and artist whose work has been used to train AI models.

On September 12, dozens of lawyers from the tech and journalism companies packed into a magistrate judge's courtroom in lower Manhattan to figure out the best way to divvy up the discovery process, including inspecting ChatGPT's code and training data. Along with the authors' lawyers, they are still deciding who they can depose and how to schedule the depositions.

"It's as thrilling as things get for law professors who work in copyright," said Kristelia García, an intellectual property law professor at Georgetown University Law.

Setting the rules

With Congress taking a backseat on AI regulation, the industry expects courts to set — or, they hope, not set — the rules.

Many publishers, including Business Insider owner Axel Springer, have struck deals with generative AI companies to share their content for LLM training.

The scope and resources of the Times' lawsuit make it a likely candidate for a precedent-setting Supreme Court. Lawyers are also looking at class action lawsuits from authors, as well a music industry case against Anthropic, as ones to watch.

"The New York Times is a journalistic juggernaut," García said. "It's big, it has a lot of content. More importantly, perhaps, it has a lot of market power behind that content."

The lawsuit argues OpenAI infringed on its intellectual property in two ways.

There is the "input" case — alleging that the LLM illegally hoovered up over 10 million New York Times articles to train ChatGPT and Microsoft Copilot without compensation. And the "output" case — arguing that when asked, ChatGPT can spit out a New York Times article that readers would otherwise pay a subscription for.

In court filings, lawyers have repeatedly cited Napster, which illegally copied millions of songs and made them available for free. OpenAI similarly used high-quality, well-researched, well-written, and fact-based New York Times articles to make ChatGPT so impressive, the Times argues.

If anything, OpenAI is worse, according to Justin Nelson, a Susman Godfrey attorney representing authors in a class action lawsuit running parallel to Times' case and in a similar case against Anthropic.

Napster was a project from college kids; OpenAI is backed by Microsoft and already worth billions.

"Instead of kids, it was a sophisticated company," Nelson told BI. "And instead of doing it for their own personal use, they were doing it for commercial gain.

Representatives for OpenAI and Microsoft didn't respond to requests for comment from Business Insider. In court, they argue the legal doctrine of "fair use" protects how their models ingest the articles. The ChatGPT outputs with near-verbatim copies of Times articles were "highly anomalous" results that aren't representative of how the app is used, they say.

Napster was sued out of existence, but it inspired the music industry to adopt MP3s and, eventually, streaming — now used for everything from video games to movies. Spotify cofounder Daniel Ek has cited Napster as an inspiration, and Napster cofounder Sean Parker has praised Spotify as a successor.

Copyright lawsuits from journalism organizations may set the pace for all AI generators, predicted García, who worked in the music industry for a decade. AI isn't particularly good at generating movies or doing reporting, but it can convincingly mimic journalism.

"Journalism is kind of the canary in the coal mine," García said. "In the same way that music was the canary back in the Napster days, because people could easily torrent an MP3. But you couldn't, at that time, easily torrent a film."

Given the sheer number of people involved, the authors' lawsuits could have an even more dramatic effect. A settlement or judgment could change business models.

"People get creative in class action settlements," said Matthew Sag, an Emory University law professor studying copyright law and artificial intelligence. "You could cut the authors of America in for a percentage of stock or something."

The source code

The nature of generative AI technology itself lies at the heart of the copyright disputes.

What actually happens when a large language model "learns" a book or news article? What about when ChatGPT digs through the model to answer a query? Does the process make a "copy" in any meaningful sense of the word? Or is the training data just part of a big slurry of ones and zeros that no longer meaningfully resemble specific works?

The lawyers and consultants poring through ChatGPT's code are trying to answer those questions. They are also examining the LLM training data and plan to ask key OpenAI executives and programmers — under oath — how the models are meant to work.

Once the code is read and depositions taken, the parties will be in a better position to argue about "fair use," a notoriously tricky legal doctrine that protects the use of "transformative" creations derived from copyrighted material.

If OpenAI really is making copies of books and news articles, Napster-style, then is its training process sufficiently transformative to be considered "fair use"? Judges across the country are "all over the map" in deciding fair use copyright cases, according to Christa Laser, an intellectual property law professor at Cleveland State University, setting up high and unpredictable stakes.

"I think that's going to be the big question at the end of the day that's going to go all the way up to the Supreme Court," Laser told BI. "That question of fair use around training data, ingesting and training."

A key "fair use" question is whether ChatGPT's creations compete with the original journalistic works — an urgent issue for news organizations.

"The news publishers are the first to bring these big suits because they have more on the line," García said.

To make a copyright claim, a plaintiff can't just point to a corpus of work used as inspiration. It needs to point to a specific work they say has been copied.

In its lawsuit, The New York Times attached tens of thousands of pages of exhibits tabulating 10,553,897 articles. It says OpenAI and Microsoft illegally violated the copyrights for each of them.

Among those articles is a 2001 story, shortly after an appellate court ruled against Napster, where a journalist asked users about what they'd do. They all agreed there was no going back.

''If Napster does shut down, there are more sites out there,'' one user told the reporter. ''And they may get a few, but they can't stop all of them.''

Correction: This story previously included Ta-Nehisi Coates among a list of named plaintiffs in a New York-based class action lawsuit brought against OpenAI and Microsoft. He is a named plaintiff in a different class action lawsuit in California.

Read the original article on Business Insider
Интернет

ТСД SAOTRON RT41 GUN: практичный, производительный, надёжный

Cyprus Business Now: high rents, financial support to wine industry, PwC’s Academy Business Professionals Certificate

NIN-SIM linkage of all phone numbers completed, says NCC

Single family residence sells for $2.4 million in San Jose

America’s Greatest Tradition

Ria.city






Read also

Boeing lawyers argue for settlement opposed by relatives of those killed in 737 Max crashes

Nicola Peltz Beckham Goes Sky-High in Black Platform Heels at Cloud23 Launch Party in Los Angeles With Brooklyn Peltz Beckham

Brett Goldstein & Imogen Poots Attend 'All of You' Premiere at BFI London Film Festival 2024

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

NIN-SIM linkage of all phone numbers completed, says NCC

Today24.pro — latest news 24/7. You can add your news instantly now — here


News Every Day

NIN-SIM linkage of all phone numbers completed, says NCC



Sports today


Новости тенниса
ATP

Шанхай (ATP). 1/4 финала. Медведев сыграет с Синнером, Алькарас – с Махачем



Спорт в России и мире
Москва

Шахматные выходные в ТРЦ «Нора»



All sports news today





Sports in Russia today

Москва

Гандбольный «Зенит» уступил в Москве ЦСКА с разницей в 2 мяча в матче лидеров Суперлиги


Новости России

Game News

Silent Hill 2 players are modding away the fog and freeing the camera to discover hidden detail in the remake


Russian.city



Губернаторы России
ТНТ

Лео Канделаки станет новым ведущим шоу «УТРО.ТНТ»


Пожар произошел на 22 этаже жилого дома в Москве

Аналитик Коротченко: целью учений НАТО является подготовка к ядерной войне

ЧТО ДЕЛАЮТ ПРЯМО НА ТЕННИСНОМ КОРТЕ! ВИДЕО. И очень важные данные для России, США, Европы и всего мира.

Массаж и лечебная физкультура: показания и противопоказания


Продюсер Финкельштейн: Шнуров не отдает 140 млн рублей за отмененный концерт

Поймайте его, если сможете: 17 октября в прокат выходит картина Дмитрия Клепацкого «Схватка»

TMZ: Суд Лос-Анджелеса снял обвинения в убийстве с рэпера Снуп Догга

Мари Краймбрери, Клава Кока, bearwolf и не только! Like FM устраивает звездный девичник


Шанхай (ATP). 1/4 финала. Медведев сыграет с Синнером, Алькарас – с Махачем

Рахимова победила Учижиму и вышла во второй круг турнира WTA в Ухане

Технологическая революция в теннисе: Уимблдон заменит линейных судей на искусственный интеллект с 2025 года

В тени победы Медведева над Циципасом: Эрика Андреева неожиданно выбила младшую сестру Мирру с турнира WTA в Ухани



Гастроэнтеролог Садыков назвал 3 основные причины непреодолимой тяги к сладкому

EVITA BEAUTY STORE - интернет-магазин косметики премиум-класса!

«Театр на Цветном» открывается оригинальной постановкой «12 клоунов в поисках счастья»

Поймайте его, если сможете: 17 октября в прокат выходит картина Дмитрия Клепацкого «Схватка»


Пресейв. Пресейв в музыке. Как сделать Пресейв. Что такое Пресейв в яндекс. Пресейв в Вк. Как сделать Пресейв на Яндексе музыке.

Воробьев после третьего поражения ЦСКА в четырех матчах: «Используем и кнут, и пряник, ищем разные варианты»

Мишустин посмотрел на стенд Чувашии в Москве, передающий аграрную мощь республики

Гандбольный «Зенит» уступил в Москве ЦСКА с разницей в 2 мяча в матче лидеров Суперлиги


Мигрантские дружины и контроль над силовиками: В МВД дали жёсткий ответ на предложение диаспор

Суд арестовал денежные средства экс-мэра Сочи Копайгородского

Рудольф-Родион, бросивший Катерину. Как Юрий Васильев славы не дождался

Вильфанд: зимой в Москве ожидается гололедица и 25-градусные морозы



Путин в России и мире






Персональные новости Russian.city
Игорь Крутой

Игорь Крутой рассказал о возвращении Валерия Леонтьева на сцену



News Every Day

Single family residence sells for $2.4 million in San Jose




Friends of Today24

Музыкальные новости

Персональные новости