News Every Day | 24 January 2025, 18:01

Can AI Pass Humanity’s Ultimate Intelligence Test?

eWeek

A groundbreaking AI benchmark called “Humanity’s Last Exam” is sending ripples through the AI community. Developed by the Center for AI Safety (CAIS) in partnership with Scale AI, it aims to be the ultimate test of whether AI can achieve human-like reasoning, creativity, and problem-solving. These traits separate true intelligence from mere mimicry.

Humanity’s Last Exam is designed to push the boundaries of what AI can do. It’s a benchmark that challenges AI systems to demonstrate capabilities far beyond traditional tasks, setting a new standard for evaluating AI.

An AI Benchmark Unlike Any Other

Humanity’s Last Exam isn’t about measuring raw computational ability or accuracy in tasks like summarizing articles or identifying images. Instead, it assesses general intelligence and ethical reasoning. The benchmark challenges AI to tackle questions in math, science, and logic while addressing moral dilemmas and the implications of emerging technologies.

“We wanted problems that would test the capabilities of the models at the frontier of human knowledge and reasoning,” explained CAIS co-founder and executive director Dan Hendrycks.

A standout feature of the benchmark is the incorporation of “open world” challenges, where problems lack a single correct answer. For example, AI might analyze hypothetical situations that weigh out ethical considerations and predict long-term consequences. This ambitious test pushes AI to demonstrate contextual understanding and judgment.

Is AI Getting Too Smart?

Critics question whether Humanity’s Last Exam overemphasizes human-like traits, sparking debates about its practicality and feeding fears of AI one day surpassing human intelligence. However, its supporters argue that benchmarks like this one are essential for exploring the true capabilities of AI and revealing its limitations. By pushing boundaries, this test offers a crucial glimpse into the future of AI, one that’s fascinating and, for some, a little unsettling. Leaving the question: Is this the key to understanding AI, or are we venturing into territory we’re not ready to face?

What Lies Ahead

The initial trials have already begun, with major players like OpenAI, Anthropic, and Google Deepmind participating. So far, OpenAI’s GPT-4 and GPT-o1 models are leading the pack, but none of the AI models have cracked the 50 percent mark… yet. Hendrycks suspects that the AI models’ scores could rise above that by the end of this year. Whether Humanity’s Last Exam will prove to be an insurmountable challenge or the beginning of a new era in artificial general intelligence remains an open question.

Read our reviews of Grok, ChatGPT, and Gemini and judge their intelligence for yourself.

The post Can AI Pass Humanity’s Ultimate Intelligence Test? appeared first on eWEEK.

Москва

КРЕЩЕНСКИЕ КУПАНИЯ

Today24.pro

Emma Raducanu parts company with another coach as Brit learns new world ranking after Australian Open

Crypto and Casinos: Bitcoin’s Ascent

USMNT Beat Costa Rica 3-0 At January Camp

Stats and facts for Score Predictor Matchweek 20 – play for free and compete for £250 weekly prize

Ria.city

Read also

Yesterday, 23:30

Watch: Trump holds Las Vegas rally less than one week after being sworn in

Yesterday, 22:44

Astros Manager Reacts To Shocking Alex Bregman Rumor With Red Sox Lurking

Yesterday, 22:00

Tottenham offered the chance to sign Barcelona centre-back

Moscow.media

News, articles, comments, with a minute-by-minute update, now on Today24.pro

News Every Day

Stats and facts for Score Predictor Matchweek 20 – play for free and compete for £250 weekly prize

Today24.pro — latest news 24/7. You can add your news instantly now — here

News Every Day

Crypto and Casinos: Bitcoin’s Ascent

Sports today

Новости тенниса

Australian Open

Соболенко — третий год подряд в финале в Австралии. Арина подобралась к величию Граф и Селеш

News.tennis

Марат Сафин сделал прогноз на финал Australian Open Янник Синнер — Александр Зверев Александр Зверев и Хольгер Руне сыграют на турнире ATP-500 в Рио-де-Жанейро Соболенко расплакалась после поражения в финале Australian Open Уроженка Красноярска Мирра Андреева проиграла в полуфинале Открытого чемпионата Австралии по теннису

Спорт в России и мире

Москва

Состязания сноубордистов пройдут на Воробьевых горах в рамках «Зимы в Москве» 9 февраля

All sports news today

Sports in Russia today

Москва

Жирков назвал футбольную столицу России: «В Москве может быть хоть 20 клубов — без разницы»

Новости России

Game News

Quit worrying about game install sizes with these future-proof SSDs

Russian.city

Москва

Москва обновила правила метро: запрет на велосипеды без чехлов

Губернаторы России

Подмосковье

В Подмосковье сотрудница Росгвардии рассказала в эфире «Радио 1» о работе с кадетами нового подшефного класса

News-life

ВИДЕО! Доклад написан: ДОКАЗАНО ПРОГРАММНОЕ УСТРОЙСТВО МИРА НА КАРТЕ МИРА! НАУКЕ ВЕКАМИ ВЫВОДЯТ РЕЗУЛЬТАТЫ. ЧИТАЙТЕ! Россия, США, Европа могут улучшить отношения и здоровье общества?!

ДОКЛАД НАПИСАН: ДЕЛА ЯНИСА ТИММЫ, ГЕНЕРАЛА КИРИЛЛОВА И... SHAMAN СОВПАЛИ НЕ СЛУЧАЙНО?! Очень важные данные! Новости. Россия, США, Европа могут улучшить отношения и здоровье общества?!

«Он как светлячок»: российские артисты поздравили Харатьяна с 65-летием

Генерал-полковник полиции Сергей Лебедев подвел итоги работы курируемых подразделений

Poisk-music.ru

Долина рассказала об отечественной замене курортам США

Нижегородский хирург Рождественская заподозрила Ариану Гранде в пластике

Бывший муж Нюши ей не изменял, коза Джигана и Самойловой научилась делать массаж, а Волочкова напугала туристов голой грудью

Анастасия Решетова показала трогательное поздравление 5-летнего сына от Тимати в день рождения: видео

News.tennis

Мэдисон Киз выиграла Australian Open

Андрей Рублёв сохранит место в топ-10 рейтинга ATP после вылета Шелтона в полуфинале АО

Стефанос Циципас философски прокомментировал поражение Бадосы от Соболенко на AO-2025

Александр Зверев и Хольгер Руне сыграют на турнире ATP-500 в Рио-де-Жанейро

Russian.city

КРЕЩЕНСКИЕ КУПАНИЯ

ВЕСЬ ГОД НА СТРАЖЕ

КРЕЩЕНСКИЕ КУПАНИЯ

Bigpot.news

Дубль Сикуры помог московскому «Динамо» прервать победную серию «Торпедо»

В Подмосковье сотрудница Росгвардии рассказала в эфире «Радио 1» о работе с кадетами нового подшефного класса

Ядерные реакторы Билла Гейтса будут питать дата-центры SDC

Путин присвоил экс-президенту Киргизии Акаеву звание заслуженного деятеля науки России

29ru.net

Москвичам рассказали о погоде в столице 26 января

СМИ: Российские следователи установили личность тех, кто отдал приказ и запустил ракету «Панцирь-С1», в результате чего рухнул самолет AZAL

Пономарёв предсказал «Зениту» тяжёлое чемпионство

Рост процедур по удалению имплантов в Москве: что за этим стоит?

Путин в России и мире

Russia24.pro

Криптовалютный брокер: как его выбрать Рост процедур по удалению имплантов в Москве: что за этим стоит? Москва обновила правила метро: запрет на велосипеды без чехлов Бывший игрок сборной России назвал футбольную столицу страны

Life24.pro

Врачи ошибаются. Сожалеют ли они об этом? Слава Бустер решился на изменения во внешности на фоне расставания с Диларой и сравнил себя с Брэдом Питтом: фото «ТОП 15 LIKE FM» – смотри, что нравится на МУЗ-ТВ Гастроэнтеролог Садыков перечислил признаки того, что стресс начал разрушать желудок

Агрегатор новостей 24СМИ

123ru.net

Басманный суд арестовал британца в Москве за нецензурную брань Мать в шоке. В убийстве предпринимателя из Москвы обвиняют звезд спорта Русский клуб избавится от санкций, легенда АПЛ уедет к шейхам. Трансферы и слухи дня Камчатка не попала в топ-50 привлекательных для туристов регионов России

Персональные новости

Today24.pro

UCL KNOCK-OUT ROUNDS AWAIT (NEARLY) Emma Raducanu parts company with another coach as Brit learns new world ranking after Australian Open USMNT Beat Costa Rica 3-0 At January Camp Crypto and Casinos: Bitcoin’s Ascent

Russian.city

Вольфганг Амадей Моцарт

Кто убил Моцарта, что скрывала Фрида Кало и почему студенты чествуют Татьяну. Ульяновцев ждут интересные выходные

Агрегатор новостей 24СМИ

News Every Day

Emma Raducanu parts company with another coach as Brit learns new world ranking after Australian Open

Today24.pro

Emma Raducanu parts company with another coach as Brit learns new world ranking after Australian Open Crypto and Casinos: Bitcoin’s Ascent USMNT Beat Costa Rica 3-0 At January Camp Stats and facts for Score Predictor Matchweek 20 – play for free and compete for £250 weekly prize

123ru.net

Сын фигуриста Плющенко победил на соревнованиях с одним участником 26 января в Москве ожидается облачная погода, небольшие осадки, местами гололедица Мать в шоке. В убийстве предпринимателя из Москвы обвиняют звезд спорта Камчатка не попала в топ-50 привлекательных для туристов регионов России

Friends of Today24

Музыкальные новости

Агрегатор новостей 24СМИ

Персональные новости