How AI models are getting smarter

Share This Post


All these things are powered by artificial-intelligence (AI) models. Most rely on a neural network, trained on massive amounts of information—text, images and the like—relevant to how it will be used. Through much trial and error the weights of connections between simulated neurons are tuned on the basis of these data, akin to adjusting billions of dials until the output for a given input is satisfactory.

There are many ways to connect and layer neurons into a network. A series of advances in these architectures has helped researchers build neural networks which can learn more efficiently and which can extract more useful findings from existing datasets, driving much of the recent progress in AI.

Most of the current excitement has been focused on two families of models: large language models (LLMs) for text, and diffusion models for images. These are deeper (ie, have more layers of neurons) than what came before, and are organised in ways that let them churn quickly through reams of data.

LLMs—such as GPT, Gemini, Claude and Llama—are all built on the so-called transformer architecture. Introduced in 2017 by Ashish Vaswani and his team at Google Brain, the key principle of transformers is that of “attention”. An attention layer allows a model to learn how multiple aspects of an input—such as words at certain distances from each other in text—are related to each other, and to take that into account as it formulates its output. Many attention layers in a row allow a model to learn associations at different levels of granularity—between words, phrases or even paragraphs. This approach is also well-suited for implementation on graphics-processing unit (GPU) chips, which has allowed these models to scale up and has, in turn, ramped up the market capitalisation of Nvidia, the world’s leading GPU-maker.

Transformer-based models can generate images as well as text. The first version of DALL-E, released by OpenAI in 2021, was a transformer that learned associations between groups of pixels in an image, rather than words in a text. In both cases the neural network is translating what it “sees” into numbers and performing maths (specifically, matrix operations) on them. But transformers have their limitations. They struggle to learn consistent world-models. For example, when fielding a human’s queries they will contradict themselves from one answer to the next, without any “understanding” that the first answer makes the second nonsensical (or vice versa), because they do not really “know” either answer—just associations of certain strings of words that look like answers.

And as many now know, transformer-based models are prone to so-called “hallucinations” where they make up plausible-looking but wrong answers, and citations to support them. Similarly, the images produced by early transformer-based models often broke the rules of physics and were implausible in other ways (which may be a feature for some users, but was a bug for designers who sought to produce photo-realistic images). A different sort of model was needed.

Not my cup of tea

Enter diffusion models, which are capable of generating far more realistic images. The main idea for them was inspired by the physical process of diffusion. If you put a tea bag into a cup of hot water, the tea leaves start to steep and the colour of the tea seeps out, blurring into clear water. Leave it for a few minutes and the liquid in the cup will be a uniform colour. The laws of physics dictate this process of diffusion. Much as you can use the laws of physics to predict how the tea will diffuse, you can also reverse-engineer this process—to reconstruct where and how the tea bag might first have been dunked.In real life the second law of thermodynamics makes this a one-way street; one cannot get the original tea bag back from the cup. But learning to simulate that entropy-reversing return trip makes realistic image-generation possible.

Training works like this. You take an image and apply progressively more blur and noise, until it looks completely random. Then comes the hard part: reversing this process to recreate the original image, like recovering the tea bag from the tea. This is done using “self-supervised learning”, similar to how LLMs are trained on text: covering up words in a sentence and learning to predict the missing words through trial and error. In the case of images, the network learns how to remove increasing amounts of noise to reproduce the original image. As it works through billions of images, learning the patterns needed to remove distortions, the network gains the ability to create entirely new images out of nothing more than random noise.


View Full Image

Graphic: The Economist

Most state-of-the-art image-generation systems use a diffusion model, though they differ in how they go about “de-noising” or reversing distortions. Stable Diffusion (from Stability AI) and Imagen, both released in 2022, used variations of an architecture called a convolutional neural network (CNN), which is good at analysing grid-like data such as rows and columns of pixels. CNNs, in effect, move small sliding windows up and down across their input looking for specific artefacts, such as patterns and corners. But though CNNs work well with pixels, some of the latest image-generators use so-called diffusion transformers, including Stability AI’s newest model, Stable Diffusion 3. Once trained on diffusion, transformers are much better able to grasp how various pieces of an image or frame of video relate to each other, and how strongly or weakly they do so, resulting in more realistic outputs (though they still make mistakes).

Recommendation systems are another kettle of fish. It is rare to get a glimpse at the innards of one, because the companies that build and use recommendation algorithms are highly secretive about them. But in 2019 Meta, then Facebook, released details about its deep-learning recommendation model (DLRM). The model has three main parts. First, it converts inputs (such as a user’s age or “likes” on the platform, or content they consumed) into “embeddings”. It learns in such a way that similar things (like tennis and ping pong) are close to each other in this embedding space.

The DLRM then uses a neural network to do something called matrix factorisation. Imagine a spreadsheet where the columns are videos and the rows are different users. Each cell says how much each user likes each video. But most of the cells in the grid are empty. The goal of recommendation is to make predictions for all the empty cells. One way a DLRM might do this is to split the grid (in mathematical terms, factorise the matrix) into two grids: one that contains data about users, and one that contains data about the videos. By recombining these grids (or multiplying the matrices) and feeding the results into another neural network for more number-crunching, it is possible to fill in the grid cells that used to be empty—ie, predict how much each user will like each video.

The same approach can be applied to advertisements, songs on a streaming service, products on an e-commerce platform, and so forth. Tech firms are most interested in models that excel at commercially useful tasks like this. But running these models at scale requires extremely deep pockets, vast quantities of data and huge amounts of processing power.

Wait until you see next year’s model

In academic contexts, where datasets are smaller and budgets are constrained, other kinds of models are more practical. These include recurrent neural networks (for analysing sequences of data), variational autoencoders (for spotting patterns in data), generative adversarial networks (where one model learns to do a task by repeatedly trying to fool another model) and graph neural networks (for predicting the outcomes of complex interactions).

Just as deep neural networks, transformers and diffusion models all made the leap from research curiosities to widespread deployment, features and principles from these other models will be seized upon and incorporated into future AI models. Transformers are highly efficient, but it is not clear that scaling them up can solve their tendencies to hallucinate and to make logical errors when reasoning. The search is already under way for “post-transformer” architectures, from “state-space models” to “neuro-symbolic” AI, that can overcome such weaknesses and enable the next leap forward. Ideally such an architecture would combine attention with greater prowess at reasoning. Right now no human yet knows how to build that kind of model. Maybe someday an AI model will do the job.

© 2024, The Economist Newspaper Limited. All rights reserved. From The Economist, published under licence. The original content can be found on www.economist.com



Source link

Related Posts

- Advertisement -spot_img
SBOBETJUDI BOLA ONLINEMIX PARLAYSBOBET88JUDI BOLA ONLINESABUNG AYAM ONLINESLOT MAHJONGLIVE CASINO ONLINEsabung ayam onlinelive casino onlineslot mahjong waysjudi bola onlineini alasan mengapa mahjong ways 2 selalu viral pg soft sering kasih kemenangan besar dengan banyak bocoran pola & triktanpa pola gak perlu rtp taktik akurat spin mahjong wins 3 terbukti auto scatter hitamtrik jitu pahami algoritma mahjong ways tutorial scatter selayar bersama admin wahanabetbagaimana sih cara cepat pahami rotasi simbol mahjong ways simak disini tips dapat cuan besar langsung dari admin pg softmain cerdas hindari nafsu spin mahjong ways 2 di wahanabet cuma sekali deposit wede terus tiap haribahaya rungkat mengintai simak guide cuan mahjong ways 2 langsung cs wahanabet 24 jamtanpa liat rtp bisa auto cuan jika ikuti tips dan trik dari wahanabet di mahjong waysbocoran pola terbaik dari wahanabet yang cuman diberikan ke pemain baru di mahjong waysrahasia cuan mahjong ways 2 ala wahanabet cukup main 3 menit ajabocoran grand master mahjong ways 2 rahasia spin santai auto maxwindari hiburan ringan bermain mahjong wins meraih kekayaan instan di wahanabetcara pemain pro mengatur tempo spin mahjong ways agar konsisten solusi untuk semua pemainSV388GA28WS168Sabung Ayam OnlineDragon TigerSBOBET88Judi Bola OnlineLive Casino OnlineSlot MahjongSabung Ayam OnlineTogel Onlinerahasia ciptakan pola starlight princessterungkap strategi sweet bonanzapola rahasia wild banditopola maxwin gates of gatot kacascatter hitam mahjong wins 3bocoran scatter mahjong ways 2scatter ganda wild bounty showdownslot777judi bolajudi bolasabung ayam online Sabung Ayam sbobet mix parlaySv388Slot thailandJudi Bola Onlinekombinasi scatter dan fitur bonus mahjong wins 3rahasia cuan mahjong ways 3 trik mudah menang bikin pemula langsung jadi sultan
asianbet77judi bolajudi bolaasianbet77asianbet77asianbet77SABUNG AYAM ONLINESV388WS168JUDI BOLA ONLINEMIX PARLAYSBOBETJUDI BOLASABUNG AYAM ONLINESLOT MAHJONGLIVE CASINO ONLINEmix parlaymix parlaymix parlaysabung ayam onlinemix parlaylive casinomix parlaysabung ayam onlinelive casinomahjongsabung ayam onlinesabung ayam onlinesabung ayam onlinejudi bolajudi bolaSITUS SLOT ONLINEjudi bolalive casino onlinesabung ayam onlineslot gacor mahjongsabung ayam onlinejudi bola onlinelive casino onlineslot mahjong wayssabung ayam onlinejudi bola onlinelive casino onlineslot mahjong wayssabung ayam onlinejudi bola onlinelive casino onlineslot mahjong waysindobit88pola ampuh bermainsbobet88live casinopola mahjongpola mahjong terbarujudi bola onlinesabung ayam onlinelive casino onlinelive casino onlinesabung ayam onlinelive casino onlineindobit88blackjack onlinesabung ayam onlinejudi bola onlineCasino OnlineMahjong Waysjudi bola onlinesabung ayam onlinecasino OnlineMahjong Waysjudi bola onlinesabung ayam onlinemahjong ways 2sabung ayam onlinesbobet88mahjong ways 2mahjong wins 3gates of olympusstarlight princesssweet bonanzasbobetsv388agen live casinosabung ayam onlinejudi bola onlineagen casino onlinejudi bola onlinesabung ayam onlinesabung ayam onlinejudi bola onlinemahjong waysmahjong winsgates of olympusstarlight princesssweet bonanzasbobetsv388agen live casinocasino onlinecasino onlinesabung ayam onlinejudi bola onlinejudi bola online
algoritma spin mahjong ways beradu di layar digitalspin berirama untuk hasil maksimal dan stabil mahjopanduan lengkap cuan mahjong ways 2langkah mudah membaca momentum scatter hitamtrik maxwin gates of gatot kacascatter hitam mahjong waysSBOBET88JUDI BOLA ONLINESBOBETMIX PARLAYjudi bola onlinesabung ayam onlinelive casino onlineslot mahjong waysjudi bola onlinetetap optimis cuan mahjong ways 2 pasti legit pakai pola ini scatter pecah beruntun trik main dituntuntrik wede elit mahjong wins 3 pola anti sulit scatter hitam pecah gak pelit kasih cuan auto legittampilan warna-warni cuan berseri tips ampuh sweet bonanza anti rip scatter x1000 pecah beruntun di pola iniscatter mulus wede lancar jurus pamungkas gates of olympus petir x1000 pecah terus ini triknyapahami & kuasai trik jitu mahjong ways 3 prediksi rotasi kombinasi simbol scatter wild tutor admin wahanabettips mudah cuan mahjong ways bocoran pola pro player menang rp.150.330.001 cuma modal 100 ribu scatter pecah dimenit ke-5mahjong ways di wahanabet berikan tips dan trik memanfaatkan momenttrik baca pola jam yang pas dalam bermain mahjong ways versi wahanabetlakukan trik ini sebelum bermain di mahjong ways dijamin cukup 8x spin naga hitam terpanggilbocoran trik untuk pemain baru di mahjong wins 3 9 dari 10 pemain baru maxwin besarkecewa kalah di mahjong ways sweet bonanza kasih solusi pasti lewat bocoran tips dan trik dari wahanabetanti rungkad di sweet bonanza jika kalian ikuti tips dari wahanabetSV388SBOBET88LIVE CASINO ONLINESCATTER HITAMSABUNG AYAM ONLINEMIX PARLAY SBOBETCASINO ONLINEZEUS SLOTMix ParlaySabung Ayam OnlineSabung Ayam OnlineLive Casino OnlineSabung Ayam OnlineSabung Ayam OnlineLive Casino OnlineSlot MahjongJudi Bola OnlineSabung Ayam OnlineSabung Ayam Onlineguru honorer maxwin mahjong wayskombinasi pola gates of olympusteknik spin turbo gates of olympusrahasia pola mahjong wins 3trik maxwin gates of gatot kacascatter hitam mahjong waysMonaco Rugby 7s Official Contactsabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
menang konsisten di wild bounty showdownrahasia wild dan scatter mahjong wins 3cara unik maxwin gates of olympusrahasia rtp mahjong ways 2 di indojawa88maxwin mahjong ways 2 di indojawa88teknik gacor wild banditobagaimana fokus dan ketenangan bisa mengantar pada kemenangan tak terdugacara kuasai rtp tanpa perlu modal besar dan tetap unggultrik mudah menang di pg soft bikin banyak pemain suksesJUDI BOLA ONLINESABUNG AYAM ONLINELIVE CASINO ONLINEMAHJONG WAYS 2judi bola onlinesabung ayam onlinelive casino onlineslot mahjong waysjudi bola onlinesabung ayam onlinelive casino onlinezeus slot gacorlangkah tepat spin turbo mahjong ways 2 simak strategi jitu pahami pola scatter cuan besar modal recehtrik unik spin sweet bonanza kombinasi turbo x manual kasih cuan rp.98.250.000 hanya dengan modal gocapbocoran trik rahasia gates of olympus menang rp.120.335.100 dalam sehari pakai pola iniclaim 150 juta pertama joni spin mahjong wins 3 pakai trik ini scatter hitam pecah dimenit ke-3 hanya pakai modal 100 ributrik rata kanan ala sepuh mahjong ways cuan puluhan juta hanya andalkan rtp 88.90% simak sampai tuntasrungkat terus coba trik mahjong ways ini cukup depo sekali cuan selangit member baru welcome player pro silahkantrik cerdas mengungkap pola dan taktik kemenangan mahjong ways versi wahanabetkuasai taktik dan strategi pola dari wahanabet di mahjong ways dijamin ketagihan berkat maxwinpanduan lengkap dari wahanabet dengan tips dan pola untuk pemula di mahjong ways 2cuman 5 menit di mahjong ways 2 bisa ubah nasib berkat ikuti tips dan trik dari admin wahanabetrahasia dari admin wahanabet yang bikin lebih optimis bermain sweet bonanzaberodal 20 ribu auto kaget saat dapat perkalian di sweet bonanza berkat bocoran dari wahanabetSV388SBOBET88LIVE CASINO ONLINESCATTER HITAMSABUNG AYAM ONLINEMIX PARLAY SBOBETCASINO ONLINEZEUS SLOTSBOBET88Sabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineJudi Bola OnlineJudi Bola OnlineJudi Bola OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam Onlinejudi bolasabung ayam onlinemahjong wayssabung ayam onlinesabung ayam onlineSBOBET88SLOT777LIVE CASINO ONLINESABUNG AYAM ONLINEAGEN JUDI BOLASLOT QRISSBOBET88SBOBETLIVE CASINO ONLINESABUNG AYAM ONLINEMIX PARLAYSLOT MAHJONGSABUNG AYAM ONLINESABUNG AYAM ONLINEa>SBOBET88JUDI BOLASBOBET88SLOT GACORLIVE CASINO ONLINESABUNG AYAM ONLINEAGEN JUDI BOLASBOBET88SABUNG AYAM ONLINELIVE CASINO ONLINESLOT DANAlive casinosabung ayam onlinemix parlaysabung ayam onlinelive casinojudi bolasabung ayam onlinelive casinomix parlaySV388SBOBETCASINO ONLINEMAHJONG WAYS 2SV388SBOBET88CASINO ONLINESLOT MAHJONGSLOT MAHJONGLIVE CASINOSABUNG AYAMMIX PARLAYsitus live casinoagen live casinosabung ayam onlinesabung ayam onlineasianbet77sabung ayam onlineasianbet77asianbet77asianbet77SBOBETSV388LIVE CASINO ONLINESPACEMANJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINESITUS BANDAR BOLAJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINESABUNG AYAM ONLINESLOT MPOSV388sabung ayam onlinejudi bola onlinelive casino onlineslot mahjong wayssabung ayam onlinejudi bola onlinelive casino onlineslot mahjong wayssabung ayam onlinejudi bola onlinelive casino onlineslot mahjong wayssabung ayam onlinejudi bola onlinelive casino onlineslot mahjong wayslive casino onlineslot mahjong gacorJudi BolaSabung Ayam onlinesabung ayam onlineJudi BolaLive Casino OnlineSabung Ayam onlineslot gacor mahjongSabung Ayam onlineslot gacor mahjongjudi bolaindobit88casino onlinesabung ayam onlineslot gacorjudi bolaslot mahjong gacorjudi bola onlineindobit88judi bolaindobit88Judi Bola OnlineSabung Ayam OnlineJudi Bola OnlineSabung Ayam OnlineJudi Bola Onlinecasino onlinemahjong waysJudi Bola OnlineCasino OnlineMahjong WaysMahjong Wayssabung ayam onlinesbobetcasino OnlineMahjong Wayssabung ayam onlinejudi bola onlinesv388sbobetmahjong ways 2mahjong wins 3gates of olympusstarlight princesssweet bonanzasbobetsv388pragmatic playsabung ayam onlinesbobet88judi bolasabung ayam onlinejudi bola onlinesabung ayam onlinemahjong ways 2mahjong wins 3gates of olympussweet bonanzastarlight princessmix parlaysabung ayam onlineagen baccaratslot gacorsitus slot onlinesabung ayam onlinejudi bola onlinecasino onlinemahjong ways 2judi bola onlinecasino onlineslot mahjongsabung ayam onlinejudi bola onlinemahjong ways 2SAbung Ayam OnlineJudi Bola OnlineSBOBET88SV388Slot Mahjongmahjong wins 3 disebut anti kalah oleh pemain indojawa88mahjong wins 3 menawarkan sensasi warna yang penuh cuan scatter hitam bertebaran
pola maxwin mahjong ways 2maxwin gates of gatot kacacara baca rtp mahjong ways 2jackpot scatter hitam mahjong winssabung ayam onlinesabung ayam onlinesabung ayam onlinejudi bola onlinesabung ayam onlinetrik rahasia mahjong ways 2 modal spin manual 200 perak scatter turun selayar bro auto cuan puluhan jutamain santai pakai pola ini sweet bonanza pecahkan bom x1000 scater warna warni kasih cuan gede brostrategi tak terduga spin mahjong wins 3 cuma modal depo 50k scatter hitam pecah joko dapat cuan besar claim wede rp.210.220.115 langsung cair ke rekeningpanen cuan pakai trik ini bocoran pola gates of olympus ala admin wahanabet bikin geger semua serverkupas tuntas kombinasi maut pola mahjong ways 3 viral cuan puluhan jutatrik ini bikin mahjong ways jadi viral bro vina nekat spin turbo raup cuan puluhan juta dalam semalamSV388SBOBET88CASINO ONLINEZEUS SLOTSABUNG AYAM ONLINEMIX PARLAY SBOBETLIVE CASINO ONLINESCATTER HITAMsabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlineMix parlaySabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam Onlineいきがい活動ステーション Accesscara pemain cerdas menang stabil di mahjong wayscara pemain mahjong ways 3 dapat scatter tanpa ribetpola ampuh pahami trik kuasai rtp agar menang
Nusa Islands Bali Official PackagesTrinidad and Tobago Pilots’ Association Official About Pagemaxwin mahjong wins 3strategi main gates of olympuskuasai pola rtp pragmatic playlangkah mendapatkan scatter emaspola rtp pg soft indojawa88Green Gold Mountain Official SiteKomite SMKN 1 Tanjung Jabung Barat Official Sitetutorial maxwin mahjong waysstrategi rtp mahjong waysEIKON Official Policieskontak situs pecinta ayamNusa Islands Bali Official ContactCitraLand Surabaya Official NewsLenterakita About PageVinayak Group Official SiteI Think An Idea Official SitePITAC Official SitePortfolioSitez Official SiteMedical LTD Official SiteCapworks Official SiteMartino & Luth Official SiteTech With Mike First Official SiteSahabat Tiopan Official SiteE-Sekolah CBT Official SiteBDF Ventura Official SiteOcean E Soft Official SiteArab DMC Official SiteBBC Noun Official SiteCang Vu Hai Phong Official SiteThe Flat Official SiteThe Black Sheep Official SiteCEM Argentina Official SiteSlot MahjongTop Dawg Tavern Official SiteKelas Nesfatin Official SiteDuhoc Interlink Official SiteKarunia Inda Med Mandiri Official SiteJFV Pulm Official SiteRatiohead Official SiteAskona Official SiteMAN Surabaya E-Learning Official SiteShaker Group Official SiteTakaKawa Shoten Official SiteBrydan Solutions Official SiteConcursos Rodin Official SiteEHOB Official SiteConmou Official SiteCareer Wings Official SiteMontero Espinosa Official SiteBDF Ventura Official SiteDesa Sangginora Official SiteBDF Ventura Official SiteTaruna Akademia Official SiteAkura Official SiteMUI Ciamis Official SiteNamulanda Technical Institute Official Site