Will A.I. Soon Outsmart Humans? Play This Puzzle to Find Out.

Share This Post

[ad_1]

In 2019, an A.I. researcher, François Chollet, designed a puzzle game that was meant to be easy for humans but hard for machines.

The game, called ARC, became an important way for experts to track the progress of artificial intelligence and push back against the narrative that scientists are on the brink of building A.I. technology that will outsmart humanity.

Mr. Chollet’s colorful puzzles test the ability to quickly identify visual patterns based on just a few examples. To play the game, you look closely at the examples and try to find the pattern.

Each example uses the pattern to transform a grid of colored squares into a new grid of colored squares:

The pattern is the same for every example.

Now, fill in the new grid by applying the pattern you learned in the examples above.

For years, these puzzles proved to be nearly impossible for artificial intelligence, including chatbots like ChatGPT.

A.I. systems typically learned their skills by analyzing huge amounts of data culled from across the internet. That meant they could generate sentences by repeating concepts they had seen a thousand times before. But they couldn’t necessarily solve new logic puzzles after seeing only a few examples.

That is, until recently. In December, OpenAI said that its latest A.I. system, called OpenAI o3, had surpassed human performance on Mr. Chollet’s test. Unlike the original version of ChatGPT, o3 was able to spend time considering different possibilities before responding.

Some saw it as proof that A.I. systems were approaching artificial general intelligence, or A.G.I., which describes a machine that’s as smart as a human. Mr. Chollet had created his puzzles as a way of showing that machines were still a long way from this ambitious goal.

But the news also exposed the weaknesses in benchmark tests like ARC, short for Abstraction and Reasoning Corpus. For decades, researchers have set up milestones to track A.I.’s progress. But once these milestones were reached, they were exposed as insufficient measures of true intelligence.

Arvind Narayanan, a Princeton computer science professor and co-author of the book “AI Snake Oil,” said that any claim that the ARC test measured progress toward A.G.I. was “very much iffy.”

Still, Mr. Narayanan acknowledged that OpenAI’s technology demonstrated impressive skills in passing the ARC test. Some of the puzzles are not as easy as the one you just tried.

The one below is little harder, and it, too, was correctly solved by OpenAI’s new A.I. system:

A puzzle like this shows that OpenAI’s technology is getting better at working through logic problems. But the average person can solve puzzles like this one in seconds. OpenAI’s technology consumed significant computing resources to pass the test.

Last June, Mr. Chollet teamed up with Mike Knoop, co-founder of the software company Zapier, to create what they called the ARC Prize. The pair financed a contest that promised $1 million to anyone who built an A.I. system that exceeded human performance on the benchmark, which they renamed “ARC-AGI.”

Companies and researchers submitted over 1,400 A.I. systems, but no one won the prize. All scored below 85 percent, which marked the performance of a “smart” human.

OpenAI’s o3 system correctly answered 87.5 percent of the puzzles. But the company ran afoul of competition rules because it spent nearly $1.5 million in electricity and computing costs to complete the test, according to pricing estimates.

OpenAI was also ineligible for the ARC Prize because it was not willing to publicly share the technology behind its A.I. system through a practice called open sourcing. Separately, OpenAI ran a “high-efficiency” variant of o3 that scored 75.7 percent on the test and cost less than $10,000.

“Intelligence is efficiency. And with these models, they are very far from human-level efficiency,” Mr. Chollet said.

(The New York Times sued OpenAI and its partner, Microsoft, in 2023 for copyright infringement of news content related to A.I. systems.)

On Monday, the ARC Prize introduced a new benchmark, ARC-AGI-2, with hundreds of additional tasks. The puzzles are in the same colorful, grid-like game format as the original benchmark, but are more difficult.

“It’s going to be harder for humans, still very doable,” said Mr. Chollet. “It will be much, much harder for A.I. — o3 is not going to be solving ARC-AGI-2.”

Here is a puzzle from the new ARC-AGI-2 benchmark that OpenAI’s system tried and failed to solve. Remember, the same pattern applies to all the examples.

Now try to fill in the grid below according to the pattern you found in the examples:

This shows that although A.I. systems are better at dealing with problems they have never seen before, they still struggle.

Here are a few additional puzzles from ARC-AGI-2, which focuses on problems that require multiple steps of reasoning:

As OpenAI and other companies continue to improve their technology, they may pass the new version of ARC. But that does not mean that A.G.I. will be achieved.

Judging intelligence is subjective. There are countless intangible indicators of intelligence, from composing works of art to navigating moral dilemmas to intuiting emotions.

Companies like OpenAI have built chatbots that can answer questions, write poetry and even solve logic puzzles. In some ways, they have already exceeded the powers of the brain. OpenAI’s technology has outperformed its chief scientist, Jakub Pachocki, on a competitive programming test.

But these systems still make mistakes that the average person would never make. And they struggle to do simple things that humans can handle.

“You’re loading the dishwasher, and your dog comes over and starts licking the dishes. What do you do?” said Melanie Mitchell, a professor in A.I. at the Santa Fe Institute. “We sort of know how to do that, because we know all about dogs and dishes and all that. But would a dishwashing robot know how to do that?”

To Mr. Chollet, the ability to efficiently acquire new skills is something that comes naturally to humans but is still lacking in A.I. technology. And it’s what he has been targeting with the ARC-AGI benchmarks.

In January, the ARC Prize became a nonprofit foundation that serves as a “north star for A.G.I.” The ARC Prize team expects ARC-AGI-2 to last for about two years before it is solved by A.I. technology — though they would not be surprised if it happened sooner.

They have already started work on ARC-AGI-3, which they hope to debut in 2026. An early mock-up hints at a puzzle that involves interacting with a dynamic, grid-based game.

A.I. researcher François Chollet designed a puzzle game meant to be easy for humans but hard for machines.

Kelsey McClellan for The New York Times

Early mock-up for ARC-AGI-3, a benchmark that could involve interacting with a dynamic, grid-based game.

ARC Prize Foundation

This is a step closer to what people deal with in the real world — a place filled with movement. It does not stand still like the puzzles you tried above.

Even this, however, will go only part of the way toward showing when machines have surpassed the brain. Humans navigate the physical world — not just the digital. The goal posts will continue to shift as A.I. advances.

“If it’s no longer possible for people like me to produce benchmarks that measure things that are easy for humans but impossible for A.I.,” Mr. Chollet said, “then you have A.G.I.”

[ad_2]

Source link

Related Posts

- Advertisement -spot_img
kelola uang bansos 900 ribu seperti baca pola di mahjong waysPemain mencari pola mahjong ways di tengah banjir sibolgaslotter bandingkan kejutan gol liga champions dan pola mahjong wins 3tren perbincangan mahjong ways meningkat memasuki musim cuti desemberpemain gunakan ramalan shio untuk gambarkan peruntungan di mahjong wins 3LIVE CASINO ONLINESLOT MAHJONG WAYSslot mahjongjudi bolaslot danaslot danaslot danaslot danasabung ayam onlinesabung ayam onlineasianbet77judi bola sbobetmix parlaymix parlaymix parlaysabung ayam onlinelive casinomahjong waysmahjong wayssabung ayam onlineJUDI BOLA ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESLOT MAHJONG WAYSSLOT MAHJONG WAYSSLOT MAHJONG WAYSJUDI BOLA ONLINEMIX PARLAYSITUS BOLA ONLINEJUDI BOLA ONLINEMIX PARLAYSITUS BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESITUS PARLAYSITUS PARLAYMIX PARLAYMIX PARLAYMIX PARLAYSITUS JUDI BOLAJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI SABUNG AYAMSITUS SABUNG AYAMSV388SBOBET88LIVE CASINO ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINESBOBETlive casino onlinesabung ayam onlineMahjong Ways 2judi bola sbobetslot mahjong wayssabung ayam onlineMahjong Ways 2Agen SBOBETLive Casino Onlinesabung ayam onlineslot danamahjong ways 2sabung ayam onlineslot mahjong gacorjudi bolascatter hitamjudi bolasv388live casinoSabung Ayam OnlineJudi Bola OnlineCasino OnlineMahjong Ways 2Slot777Sabung Ayam OnlineSabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bola onlinesabung ayam onlineslot pulsaindobit88indobit88slot gacorCASINO ONLINESLOT ZEUSJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESLOT MAHJONGLIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINE
JUDI BOLA ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINEjudi bola onlinejudi bola onlinesabung ayam onlinesitus toto loginSV388SBOBET WAPBlackjack & BaccaratMahjong WaysSabung Ayam OnlineJudi Bola OnlineAgen SicboSlot Gacor Onlineslot thailandsabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineagen live casino onlineslot mahjong ways 2bandar togel onlinesitus live casinosabung ayam onlinepengaruh isu bansos terhadap pola mahjong wayswifi 100 ribu lancar netizen tes kecepatan buat ngulik pola mahjong wayshari guru nasional waktu pas buat ngulik ilmu pola mahjong wayssuperbank resmi ipo strategi investasi dan pola kemenangan mahjong wins 3tiket pesawat turun netizen ikut bahas pola turun naik mahjong wayscuti bersama waktunya rehat dan ngulik analogi kemenangan mahjong wins 3Hongkong PoolsMahjong WaysLive Casino OnlineSabung Ayam OnlineJudi Online
judi bola onlinejudi bola onlinesabung ayam onlinelive casino onlinejudi bola onlinejudi bola onlinejuara303juara303juara303juara303juara303juara303juara303juara303SV388Mix ParlayLive Casino OnlineSlot GacorSabung Ayam OnlineMix ParlayAgen BlackjackPRAGMATIC PLAYsabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlineslot mahjong ways 2sabung ayam onlinejudi bola onlineagen live casino onlinebandar togel onlinesabung ayam onlinejudi bolasabung ayam onlinejudi bolasabung ayam onlinehari guru nasional bikin semangat belajar termasuk pahami pola mahjong waysdinamika gempa blitar magnitudo dan fenomena pola yang berguncang mahjong ways
Slot Mahjong Gacorsabung ayam onlinejudi bolalive casinoindobit88judi bolaslot mahjong gacorslot pulsajudi bolalive casino onlinesabung ayam onlinemahjong ways 2sbobetsv388slot zeussabung ayam onlinesitus judi bolaMahjong Ways 2situs judi bolasitus live casinosabung ayam onlinejudi bolapoker onlineindobit88Sabung Ayam OnlineJudi Bola OnlineCasino OnlineSlot777Sabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bolajudi bolasv388judi bolajudi bola onlineslot depo 10kcasino onlinesabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinelive casino onlinesabung ayam onlinesv388sbobet88casino onlinescatter hitamsabung ayam onlinemix parlay sbobetlive casino onlinezeus slotSV388Bandar Judi BolaDream GamingMahjong Ways 2Wala MeronMix ParlayPokerSlot Mahjongmahjong ways 2sabung ayam onlinemahjong ways 2mahjong ways 2sabung ayam onlinesabung ayam onlinesabung ayam onlinejudi bola onlinejudi bola onlineagen live casino onlinesitus live casino onlinesitus live casinosabung ayam onlinejudi bola onlinekajian pola mahjong ways dalam konteks pembelajaran hari guruketerkaitan tren harga emas antam dengan pola mahjong wayspola perubahan harga bbm pertamina ke dinamika mahjong waysjudi bolajudi bolajudi bolajudi bolasabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
JUDI BOLA ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINEMAHJONG WAYSjudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlinelive casino onlineslot mahjong waysjuara303juara303juara303juara303juara303juara303juara303juara303Sabung Ayam OnlineMix ParlayBandar Casino OnlineMahjong WaysWala MeronJudi BolaPokerSlot Mahjongjudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineSLOT MAHJONGmahjong ways 2judi bolamahjong ways 2sabung ayam onlinetosayama academy workshopsabung ayam onlinejudi bola onlinesitus live casino onlinesabung ayam onlinejudi bola onlineagen live casino onlineimplementasi logika analisis bmkg dalam membaca tren mahjong wayscloudflare jadi faktor mudahnya menang di permainan mahjong wayssiswa srma 44 minahasa memahami probabilitas melalui pola digital mahjong wayspola mahjong ways bisa bikin untung besar walaupun harga emas jatuhgunung semeru erupsi bikin geger tetapi pola majong ways lebih bikin dagdigdugsabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
judi bolaslot pulsaslot pulsaslot gacor mahjongsabung ayam onlinelive casino onlineindobit88judi bolasv388judi bolaMAHJONG WAYS 2LIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINEmix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong wayssabung ayam onlinesabung ayam onlinemix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong waysmix parlaymahjong slotSABUNG AYAM ONLINESITUS LIVE CASINO ONLINESLOT MAHJONGSLOT777SLOT MAHJONGSLOT THAILANDJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESLOT MAHJONG WAYSSLOT MAHJONG WAYSSITUS JUDI BOLAJUDI BOLA ONLINELIVE CASINO ONLINESLOT KAKEK ZEUSMIX PARLAYSABUNG AYAM ONLINESLOT MAHJONG WAYSSABUNG AYAM ONLINEjudi bolaagen baccaratsv388Slot Mahjong Gacorlive casinosv388
Mahjong Ways 2mahjong ways 2daftar dan login wahanabetCapWorks Official ContactAynsley Official SitedexelTienda de antigüedades y muebles rústicos会社概要 / Company ProfileHarifuku Clinic Official AccessNusa Islands Bali Official PackagesTrinidad and Tobago Pilots’ Association Official About Pagekuasai pola rtp pragmatic playlangkah mendapatkan scatter emaspola rtp pg soft indojawa88Green Gold Mountain Official SiteKomite SMKN 1 Tanjung Jabung Barat Official Sitetutorial maxwin mahjong waysstrategi rtp mahjong waysEIKON Official Policieskontak situs pecinta ayamNusa Islands Bali Official ContactCitraLand Surabaya Official NewsLenterakita About PageVinayak Group Official SiteI Think An Idea Official SitePITAC Official SitePortfolioSitez Official SiteMedical LTD Official SiteCapworks Official SiteMartino & Luth Official SiteTech With Mike First Official SiteSahabat Tiopan Official SiteE-Sekolah CBT Official SiteBDF Ventura Official SiteOcean E Soft Official SiteArab DMC Official SiteBBC Noun Official SiteCang Vu Hai Phong Official SiteThe Flat Official SiteThe Black Sheep Official SiteCEM Argentina Official SiteSlot MahjongTop Dawg Tavern Official SiteKelas Nesfatin Official SiteDuhoc Interlink Official SiteKarunia Inda Med Mandiri Official SiteJFV Pulm Official SiteRatiohead Official SiteAskona Official SiteMAN Surabaya E-Learning Official SiteShaker Group Official SiteTakaKawa Shoten Official SiteBrydan Solutions Official SiteConcursos Rodin Official SiteEHOB Official SiteConmou Official SiteCareer Wings Official SiteMontero Espinosa Official SiteBDF Ventura Official SiteDesa Sangginora Official SiteBDF Ventura Official SiteTaruna Akademia Official SiteAkura Official SiteMUI Ciamis Official SiteNamulanda Technical Institute Official Site