The end of AI scaling may not be nigh: Here’s what’s next

Share This Post

[ad_1]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


As AI systems achieve superhuman performance in increasingly complex tasks, the industry is grappling with whether bigger models are even possible — or if innovation must take a different path.

The general approach to large language model (LLM) development has been that bigger is better, and that performance scales with more data and more computing power. However, recent media discussions have focused on how LLMs are approaching their limits. “Is AI hitting a wall?” The Verge questioned, while Reuters reported that “OpenAI and others seek new path to smarter AI as current methods hit limitations.” 

The concern is that scaling, which has driven advances for years, may not extend to the next generation of models. Reporting suggests that the development of frontier models like GPT-5, which push the current limits of AI, may face challenges due to diminishing performance gains during pre-training. The Information reported on these challenges at OpenAI and Bloomberg covered similar news at Google and Anthropic. 

This issue has led to concerns that these systems may be subject to the law of diminishing returns — where each added unit of input yields progressively smaller gains. As LLMs grow larger, the costs of getting high-quality training data and scaling infrastructure increase exponentially, reducing the returns on performance improvement in new models. Compounding this challenge is the limited availability of high-quality new data, as much of the accessible information has already been incorporated into existing training datasets. 

This does not mean the end of performance gains for AI. It simply means that to sustain progress, further engineering is needed through innovation in model architecture, optimization techniques and data use.

Learning from Moore’s Law

A similar pattern of diminishing returns appeared in the semiconductor industry. For decades, the industry had benefited from Moore’s Law, which predicted that the number of transistors would double every 18 to 24 months, driving dramatic performance improvements through smaller and more efficient designs. This too eventually hit diminishing returns, beginning somewhere between 2005 and 2007 due to Dennard Scaling — the principle that shrinking transistors also reduces power consumption— having hit its limits which fueled predictions of the death of Moore’s Law.

I had a close up view of this issue when I worked with AMD from 2012-2022. This problem did not mean that semiconductors — and by extension computer processors — stopped achieving performance improvements from one generation to the next. It did mean that improvements came more from chiplet designs, high-bandwidth memory, optical switches, more cache memory and accelerated computing architecture rather than the scaling down of transistors.

New paths to progress

Similar phenomena are already being observed with current LLMs. Multimodal AI models like GPT-4o, Claude 3.5 and Gemini 1.5 have proven the power of integrating text and image understanding, enabling advancements in complex tasks like video analysis and contextual image captioning. More tuning of algorithms for both training and inference will lead to further performance gains. Agent technologies, which enable LLMs to perform tasks autonomously and coordinate seamlessly with other systems, will soon significantly expand their practical applications.

Future model breakthroughs might arise from one or more hybrid AI architecture designs combining symbolic reasoning with neural networks. Already, the o1 reasoning model from OpenAI shows the potential for model integration and performance extension. While only now emerging from its early stage of development, quantum computing holds promise for accelerating AI training and inference by addressing current computational bottlenecks.

The perceived scaling wall is unlikely to end future gains, as the AI research community has consistently proven its ingenuity in overcoming challenges and unlocking new capabilities and performance advances. 

In fact, not everyone agrees that there even is a scaling wall. OpenAI CEO Sam Altman was succinct in his views: “There is no wall.”

Source: X https://x.com/sama/status/1856941766915641580 

Speaking on the “Diary of a CEO” podcast, ex-Google CEO and co-author of Genesis Eric Schmidt essentially agreed with Altman, saying he does not believe there is a scaling wall — at least there won’t be one over the next five years. “In five years, you’ll have two or three more turns of the crank of these LLMs. Each one of these cranks looks like it’s a factor of two, factor of three, factor of four of capability, so let’s just say turning the crank on all these systems will get 50 times or 100 times more powerful,” he said.

Leading AI innovators are still optimistic about the pace of progress, as well as the potential for new methodologies. This optimism is evident in a recent conversation on “Lenny’s Podcast” with OpenAI’s CPO Kevin Weil and Anthropic CPO Mike Krieger.

Source: https://www.youtube.com/watch?v=IxkvVZua28k 

In this discussion, Krieger described that what OpenAI and Anthropic are working on today “feels like magic,” but acknowledged that in just 12 months, “we’ll look back and say, can you believe we used that garbage? … That’s how fast [AI development] is moving.” 

It’s true — it does feel like magic, as I recently experienced when using OpenAI’s Advanced Voice Mode. Speaking with ‘Juniper’ felt entirely natural and seamless, showcasing how AI is evolving to understand and respond with emotion and nuance in real-time conversations.

Krieger also discusses the recent o1 model, referring to this as “a new way to scale intelligence, and we feel like we’re just at the very beginning.” He added: “The models are going to get smarter at an accelerating rate.” 

These expected advancements suggest that while traditional scaling approaches may or may not face diminishing returns in the near-term, the AI field is poised for continued breakthroughs through new methodologies and creative engineering.

Does scaling even matter?

While scaling challenges dominate much of the current discourse around LLMs, recent studies suggest that current models are already capable of extraordinary results, raising a provocative question of whether more scaling even matters.

A recent study forecasted that ChatGPT would help doctors make diagnoses when presented with complicated patient cases. Conducted with an early version of GPT-4, the study compared ChatGPT’s diagnostic capabilities against those of doctors with and without AI help. A surprising outcome revealed that ChatGPT alone substantially outperformed both groups, including doctors using AI aid. There are several reasons for this, from doctors’ lack of understanding of how to best use the bot to their belief that their knowledge, experience and intuition were inherently superior.

This is not the first study that shows bots achieving superior results compared to professionals. VentureBeat reported on a study earlier this year which showed that LLMs can conduct financial statement analysis with accuracy rivaling — and even surpassing — that of professional analysts. Also using GPT-4, another goal was to predict future earnings growth. GPT-4 achieved 60% accuracy in predicting the direction of future earnings, notably higher than the 53 to 57% range of human analyst forecasts.

Notably, both these examples are based on models that are already out of date. These outcomes underscore that even without new scaling breakthroughs, existing LLMs are already capable of outperforming experts in complex tasks, challenging assumptions about the necessity of further scaling to achieve impactful results. 

Scaling, skilling or both

These examples show that current LLMs are already highly capable, but scaling alone may not be the sole path forward for future innovation. But with more scaling possible and other emerging techniques promising to improve performance, Schmidt’s optimism reflects the rapid pace of AI advancement, suggesting that in just five years, models could evolve into polymaths, seamlessly answering complex questions across multiple fields. 

Whether through scaling, skilling or entirely new methodologies, the next frontier of AI promises to transform not just the technology itself, but its role in our lives. The challenge ahead is ensuring that progress remains responsible, equitable and impactful for everyone.

Gary Grossman is EVP of technology practice at Edelman and global lead of the Edelman AI Center of Excellence.

DataDecisionMakers

Welcome to the VentureBeat community!

DataDecisionMakers is where experts, including the technical people doing data work, can share data-related insights and innovation.

If you want to read about cutting-edge ideas and up-to-date information, best practices, and the future of data and data tech, join us at DataDecisionMakers.

You might even consider contributing an article of your own!

Read More From DataDecisionMakers


[ad_2]
Source link

Related Posts

- Advertisement -spot_img
LIVE CASINO ONLINESLOT MAHJONG WAYSslot mahjongjudi bolaslot danaslot danaslot danaslot danasabung ayam onlinesabung ayam onlineasianbet77judi bola sbobetmix parlaymix parlaymix parlaysabung ayam onlinelive casinomahjong waysmahjong wayssabung ayam onlineJUDI BOLA ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESLOT MAHJONG WAYSSLOT MAHJONG WAYSSLOT MAHJONG WAYSJUDI BOLA ONLINEMIX PARLAYSITUS BOLA ONLINEJUDI BOLA ONLINEMIX PARLAYSITUS BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESITUS PARLAYSITUS PARLAYMIX PARLAYMIX PARLAYMIX PARLAYSITUS JUDI BOLAJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI SABUNG AYAMSITUS SABUNG AYAMSV388SBOBET88LIVE CASINO ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINESBOBETlive casino onlinesabung ayam onlineMahjong Ways 2judi bola sbobetslot mahjong wayssabung ayam onlineMahjong Ways 2Agen SBOBETLive Casino Onlinesabung ayam onlineslot danamahjong ways 2sabung ayam onlineslot mahjong gacorjudi bolascatter hitamjudi bolasv388live casinoSabung Ayam OnlineJudi Bola OnlineCasino OnlineMahjong Ways 2Slot777Sabung Ayam OnlineSabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bola onlinesabung ayam onlineslot pulsaindobit88indobit88slot gacorCASINO ONLINESLOT ZEUSJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESLOT MAHJONGLIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINE
JUDI BOLA ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINEjudi bola onlinejudi bola onlinesabung ayam onlinesitus toto loginSV388SBOBET WAPBlackjack & BaccaratMahjong WaysSabung Ayam OnlineJudi Bola OnlineAgen SicboSlot Gacor Onlineslot thailandsabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineagen live casino onlineslot mahjong ways 2bandar togel onlinesitus live casinosabung ayam onlinepengaruh isu bansos terhadap pola mahjong wayswifi 100 ribu lancar netizen tes kecepatan buat ngulik pola mahjong wayshari guru nasional waktu pas buat ngulik ilmu pola mahjong wayssuperbank resmi ipo strategi investasi dan pola kemenangan mahjong wins 3tiket pesawat turun netizen ikut bahas pola turun naik mahjong wayscuti bersama waktunya rehat dan ngulik analogi kemenangan mahjong wins 3Hongkong PoolsMahjong WaysLive Casino OnlineSabung Ayam OnlineJudi Online
judi bola onlinejudi bola onlinesabung ayam onlinelive casino onlinejudi bola onlinejudi bola onlinejuara303juara303juara303juara303juara303juara303juara303juara303SV388Mix ParlayLive Casino OnlineSlot GacorSabung Ayam OnlineMix ParlayAgen BlackjackPRAGMATIC PLAYsabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlineslot mahjong ways 2sabung ayam onlinejudi bola onlineagen live casino onlinebandar togel onlinesabung ayam onlinejudi bolasabung ayam onlinejudi bolasabung ayam onlinehari guru nasional bikin semangat belajar termasuk pahami pola mahjong waysdinamika gempa blitar magnitudo dan fenomena pola yang berguncang mahjong ways
Slot Mahjong Gacorsabung ayam onlinejudi bolalive casinoindobit88judi bolaslot mahjong gacorslot pulsajudi bolalive casino onlinesabung ayam onlinemahjong ways 2sbobetsv388slot zeussabung ayam onlinesitus judi bolaMahjong Ways 2situs judi bolasitus live casinosabung ayam onlinejudi bolapoker onlineindobit88Sabung Ayam OnlineJudi Bola OnlineCasino OnlineSlot777Sabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bolajudi bolasv388judi bolajudi bola onlineslot depo 10kcasino onlinesabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinelive casino onlinesabung ayam onlinesv388sbobet88casino onlinescatter hitamsabung ayam onlinemix parlay sbobetlive casino onlinezeus slotSV388Bandar Judi BolaDream GamingMahjong Ways 2Wala MeronMix ParlayPokerSlot Mahjongmahjong ways 2sabung ayam onlinemahjong ways 2mahjong ways 2sabung ayam onlinesabung ayam onlinesabung ayam onlinejudi bola onlinejudi bola onlineagen live casino onlinesitus live casino onlinesitus live casinosabung ayam onlinejudi bola onlinekajian pola mahjong ways dalam konteks pembelajaran hari guruketerkaitan tren harga emas antam dengan pola mahjong wayspola perubahan harga bbm pertamina ke dinamika mahjong waysjudi bolajudi bolajudi bolajudi bolasabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
JUDI BOLA ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINEMAHJONG WAYSjudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlinelive casino onlineslot mahjong waysjuara303juara303juara303juara303juara303juara303juara303juara303Sabung Ayam OnlineMix ParlayBandar Casino OnlineMahjong WaysWala MeronJudi BolaPokerSlot Mahjongjudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineSLOT MAHJONGmahjong ways 2judi bolamahjong ways 2sabung ayam onlinetosayama academy workshopsabung ayam onlinejudi bola onlinesitus live casino onlinesabung ayam onlinejudi bola onlineagen live casino onlineimplementasi logika analisis bmkg dalam membaca tren mahjong wayscloudflare jadi faktor mudahnya menang di permainan mahjong wayssiswa srma 44 minahasa memahami probabilitas melalui pola digital mahjong wayspola mahjong ways bisa bikin untung besar walaupun harga emas jatuhgunung semeru erupsi bikin geger tetapi pola majong ways lebih bikin dagdigdugsabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
judi bolaslot pulsaslot pulsaslot gacor mahjongsabung ayam onlinelive casino onlineindobit88judi bolasv388judi bolaMAHJONG WAYS 2LIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINEmix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong wayssabung ayam onlinesabung ayam onlinemix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong waysmix parlaymahjong slotSABUNG AYAM ONLINESITUS LIVE CASINO ONLINESLOT MAHJONGSLOT777SLOT MAHJONGSLOT THAILANDJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESLOT MAHJONG WAYSSLOT MAHJONG WAYSSITUS JUDI BOLAJUDI BOLA ONLINELIVE CASINO ONLINESLOT KAKEK ZEUSMIX PARLAYSABUNG AYAM ONLINESLOT MAHJONG WAYSSABUNG AYAM ONLINEjudi bolaagen baccaratsv388Slot Mahjong Gacorlive casinosv388
Mahjong Ways 2mahjong ways 2daftar dan login wahanabetCapWorks Official ContactAynsley Official SitedexelTienda de antigüedades y muebles rústicos会社概要 / Company ProfileHarifuku Clinic Official AccessNusa Islands Bali Official PackagesTrinidad and Tobago Pilots’ Association Official About Pagekuasai pola rtp pragmatic playlangkah mendapatkan scatter emaspola rtp pg soft indojawa88Green Gold Mountain Official SiteKomite SMKN 1 Tanjung Jabung Barat Official Sitetutorial maxwin mahjong waysstrategi rtp mahjong waysEIKON Official Policieskontak situs pecinta ayamNusa Islands Bali Official ContactCitraLand Surabaya Official NewsLenterakita About PageVinayak Group Official SiteI Think An Idea Official SitePITAC Official SitePortfolioSitez Official SiteMedical LTD Official SiteCapworks Official SiteMartino & Luth Official SiteTech With Mike First Official SiteSahabat Tiopan Official SiteE-Sekolah CBT Official SiteBDF Ventura Official SiteOcean E Soft Official SiteArab DMC Official SiteBBC Noun Official SiteCang Vu Hai Phong Official SiteThe Flat Official SiteThe Black Sheep Official SiteCEM Argentina Official SiteSlot MahjongTop Dawg Tavern Official SiteKelas Nesfatin Official SiteDuhoc Interlink Official SiteKarunia Inda Med Mandiri Official SiteJFV Pulm Official SiteRatiohead Official SiteAskona Official SiteMAN Surabaya E-Learning Official SiteShaker Group Official SiteTakaKawa Shoten Official SiteBrydan Solutions Official SiteConcursos Rodin Official SiteEHOB Official SiteConmou Official SiteCareer Wings Official SiteMontero Espinosa Official SiteBDF Ventura Official SiteDesa Sangginora Official SiteBDF Ventura Official SiteTaruna Akademia Official SiteAkura Official SiteMUI Ciamis Official SiteNamulanda Technical Institute Official Site