Deep Cogito v2 open source models have self-improving intuition

Share This Post


Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now


Deep Cogito, a lesser-known AI research startup based in San Francisco founded by ex-Googlers, has released four new open-ish large language models (LLMs) that attempt something few others do: Learning how to reason more effectively over time — and get better at it on their own.

The models, released as part of Cogito’s v2 family, range from 70 billion to 671 billion parameters and are available for AI developers and enterprises to use under a mix of limited and fully open licensing terms. They include:

  • Cogito v2-70B (Dense)
  • Cogito v2-109B (Mixture-of-experts)
  • Cogito v2-405B (Dense)
  • Cogito v2-671B (MoE)

Dense and MoE models are each suited to different needs. Dense 70B and 405B variant models activate all parameters on every forward pass, making them more predictable and easier to deploy across a wide range of hardware.

They’re ideal for low-latency applications, fine-tuning and environments with limited GPU capacity. MoE models, such as the 109B and 671B versions, use a sparse routing mechanism to activate only a few specialized “expert” subnetworks at a time, allowing for much larger total model sizes without proportional increases in compute cost.


The AI Impact Series Returns to San Francisco – August 5

The next phase of AI is here – are you ready? Join leaders from Block, GSK, and SAP for an exclusive look at how autonomous agents are reshaping enterprise workflows – from real-time decision-making to end-to-end automation.

Secure your spot now – space is limited: https://bit.ly/3GuuPLF


This makes them well-suited for high-performance inference tasks, research into complex reasoning or serving frontier-level accuracy at lower runtime expense. In Cogito v2, the 671B MoE model serves as the flagship, leveraging its scale and routing efficiency to match or exceed leading open models on benchmarks — while using significantly shorter reasoning chains.

The models are available now on Hugging Face for download and usage by enterprises and on Unsloth for local usage, or, for those who can’t host the model inferences on their own hardware, through application programming interfaces (APIs) from Together AI, Baseten and RunPod.

There’s also a quantized “8-bit floating point (FP8)” version of the 671B model, which reduces the size of the numbers used to represent the model’s parameters from 16-bits to 8-bits, helping users run massive models faster, cheaper and on more accessible hardware — sometimes with only a negligible hit to performance (95 to 99%). However, this can slightly degrade model accuracy, especially for tasks requiring fine-grained precision (some math or reasoning problems).

All four Cogito v2 models are designed as hybrid reasoning systems: They can respond immediately to a query, or, when needed, reflect internally before answering.

Crucially, that reflection is not just runtime behavior — it’s baked into the training process itself.

These models are trained to internalize their own reasoning. That means the very paths they take to arrive at answers — the mental steps, so to speak — are distilled back into the models’ weights.

Over time, they learn which lines of thinking actually matter and which don’t.

As Deep Cogito’s blog post notes, the researchers “disincentivize the model from ‘meandering more’ to be able to arrive at the answer, and instead develop a stronger intuition for the right search trajectory for the reasoning process.”

The result, Deep Cogito claims, is faster, more efficient reasoning and a general improvement in performance, even in so-called “standard” mode.

Self-improving AI

While many in the AI community are just encountering the company, Deep Cogito has been quietly building for over a year.

It emerged from stealth in April 2025 with a series of open-source models trained on Meta’s Llama 3.2. Those early releases showed promising results.

As VentureBeat previously reported, the smallest Cogito v1 models (3B and 8B) outperformed Llama 3 counterparts across several benchmarks — sometimes by wide margins.

Deep Cogito CEO and co-founder Drishan Arora — previously a lead LLM engineer at Google — described the company’s long-term goal as building models that can reason and improve with each iteration, much like how AlphaGo refined its strategy through self-play.

Deep Cogito’s core method, iterated distillation and amplification (IDA), replaces hand-written prompts or static teachers with the model’s own evolving insights.

What is ‘machine intuition’?

With Cogito v2, the team took that loop to a much larger scale. The central idea is simple: Reasoning shouldn’t just be an inference-time tool; it should be part of the model’s core intelligence.

So, the company implemented a system where the model runs reasoning chains during training, and then is trained on its intermediate thoughts.

This process yields concrete improvements, according to internal benchmarks. The flagship 671B MoE model outperforms DeepSeek R1 in reasoning tasks, matching or beating its latest 0528 model while using 60% shorter reasoning chains.

On MMLU, GSM8K and MGSM, Cogito 671B MoE’s performance was roughly on par with top open models like Qwen1.5-72B and DeepSeek v3, and approached the performance tier of closed models like Claude 4 Opus and o3.

Specifically:

  • Cogito 671B MoE (reasoning mode) matched DeepSeek R1 0528 across multilingual QA and general knowledge tasks, and outperformed it on strategy and logical deduction.
  • In non-reasoning mode, it exceeded DeepSeek v3 0324, suggesting that the distilled intuition carried real performance weight even without an extended reasoning path.
  • The model’s ability to complete reasoning in fewer steps also had downstream effects: Lower inference costs and faster response times on complex prompts.

Arora explains this as a difference between searching for a path versus already knowing roughly where the destination lies.

“Since the Cogito models develop a better intuition of the trajectory to take while searching at inference time, they have 60% shorter reasoning chains than Deepseek R1,” he wrote in a thread on X.

What kinds of tasks do Deep Cogito’s new models excel at when using their machine intuition?

Some of the most compelling examples from Cogito v2’s internal testing highlight exactly how this manifests in use.

In one math-heavy prompt, a user asks whether a train traveling at 80 mph can reach a city 240 miles away in under 2.5 hours.

While many models simulate the calculation step-by-step and occasionally make unit conversion errors, Cogito 671B reflects internally, determines that 240 ÷ 80 = 3 hours, and correctly concludes that the train cannot arrive in time. It does so with only a short internal reasoning trace — under 100 tokens — compared to the 200-plus used by DeepSeek R1 to reach the same answer.

In another example involving legal reasoning, a user asks whether a specific U.S. Supreme Court ruling would apply to a hypothetical case involving search and seizure. Cogito’s reasoning mode highlights a two-step logic: Dirst determining whether the hypothetical matches the precedent, then explaining why it does or doesn’t. The model reaches a nuanced answer with clear justification — a kind of interpretive reasoning that many LLMs still struggle with.

Other tasks show improvements in handling ambiguity. On a classic multi-hop question — “If Alice is Bob’s mother, and Bob is Charlie’s father, what is Alice to Charlie?” — models often get tangled in pronouns. Cogito v2’s models correctly identify Alice as Charlie’s grandmother, even in slightly reworded variants where other open models falter.

Efficiency at scale

Despite the massive size of the new models, Deep Cogito claims to have trained all eight of its Cogito models — including smaller v1 checkpoints — for under $3.5 million in total, compared to the reported $100 million plus for some of OpenAI’s leading models.

That includes data generation, synthetic reinforcement, infrastructure and more than 1,000 training experiments. Compared to the nine-figure budgets of other frontier models, it’s a fraction of the typical spend.

Arora attributes this frugality to the company’s core thesis: Smarter models need better priors, not more tokens.

By teaching the model to skip redundant or misleading reasoning paths, Cogito v2 delivers stronger performance without ballooning inference time.

That’s a meaningful tradeoff for users running models on API infrastructure or edge devices where latency and cost matter.

What’s next for Deep Cogito and v2?

The release of Cogito v2 is not a final product, but an iterative step. Arora describes the company’s roadmap as “hill climbing” — running models, learning from their reasoning traces, distilling them and repeating the loop. Over time, each model becomes a stepping stone for the next.

Every model Deep Cogito has released is open source, and the company says that will remain true for future iterations.

Already, its work has attracted attention and support from backers like Benchmark’s Eric Vishria and South Park Commons’ Aditya Agarwal.

Infrastructure partners include Hugging Face, Together AI, RunPod, Baseten, Meta’s Llama team and Unsloth.

For developers, researchers, and enterprise teams, the models are available now. Developers can run them locally, compare modes or fine-tune for specific use cases.

And, for the broader open-source AI community, Cogito v2 offers more than just a new benchmark winner — it proposes a different way to build intelligence. Not by thinking harder, but by learning how to think better.



Source link

Related Posts

Voyager Technologies Acquires ExoTerra Resources

SAN FRANCISCO — Voyager Technologies announced the acquisition...

AMD is rebadging 2022 Ryzen processors as ‘new’ chips

If you’re in the market for a new...

Access Denied

Access Denied You don't have permission to access...

Apple may bring vapor chamber cooling to future iPad Pro models

Apple could be gearing up for a major...
- Advertisement -spot_img
cara pemain cerdas menang stabil di mahjong wayscara pemain mahjong ways 3 dapat scatter tanpa ribetpola ampuh pahami trik kuasai rtp agar menang
SV388SV388JUDI BOLA ONLINESBOBET88sabung ayam onlinejudi bola onlinelive casino onlinejudi bola onlinesabung ayam onlinelive casino onlineungkap pola misterius mahjong ways jarwo dikasih menang rp.221.330.110 cuma spin manual x10trik nekat modal 55 ribu login mahjong ways 2 stella menang rp.110.500.300 saldo langsung cair via danawahanabet ungkap trik dapat cuan besar disemua game online hingga bocoran pola & rtp tinggistrategi ampuh candy pops sweet bonanza kombinasi pola & rtp 98.21% trik ledakan bom x1000 auto cuanhokimu tiba hari ini budi main wild bandito hanya modal 100 ribu abaikan rtp cukup pakai trik ini 15x putaran langsung wedepola klasik gates of olympus trik jitu yang satu ini gak ada matinya depo 45 ribu masih worth it brosabung ayam onlinesabung ayam onlinesabung ayam onlineSBOBET88sabung ayam onlineindopromaxindopromaxindopromaxindopromaxindopromaxindopromaxindopromaxindopromaxSabung Ayam OnlineSabung Ayam OnlineSabung Ayam Onlinejudi bola onlinejudi bolajudi bolasabung ayam onlinesabung ayam onlinesabung ayam onlinelive casino online sabung ayam slot mahjong judi bola SV388jUDI BOLASBOBET88SBOBET88WS168LIVE CASINO ONLINESBOBET88SV388SEXYGAMINGINDOBALI88SABA SPORTSV388LIVE CASINOSV388Mahjong WaysSABUNG AYAM ONLINELive Casino OnlineSabung Ayam onlinemahjong ways 2sabung ayam onlinejudi bola onlinelive casino onlineslot gacor mahjongslot gacor mahjongslot gacor mahjongslot gacor mahjongLIVE CASINO ONLINESBOBETSABUNG AYAM ONLINESABUNG AYAM ONLINECASINO ONLINECASINO ONLINELIVE CASINO ONLINEJUDI BOLALIVE CASINO ONLINEMAHJONGSABUNG AYAM ONLINESITUS JUDI BOLASABUNG AYAM ONLINELIVE CASINO ONLINESLOT MAHJONGlive casinomix parlaymix parlaysabung ayam onlinelive casinomix parlaysabung ayam onlinesabung ayam onlinemix parlaysabung ayam onlinemix parlaysabung ayam onlinemix parlayparlaysitus live casinojudi bolaSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSABUNG AYAMJUDI BOLALIVE CASINOSLOT MAHJONGMAHJONG WAYSJUDI BOLA ONLINESABUNG AYAM ONLINESWEET BONANZASLOT ZEUSSV388JUDI BOLA ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESABUNG AYAM ONLINELIVE CASINO ONLINESITUS SLOT ONLINEPRAGMATIC PLAYMAHJONG WAYSJudi BolaLive Casino OnlineSabung Ayam onlinemahjong ways 2Judi Bolamahjong ways 2mahjong wins 3gates of olympussweet bonanzastarlight princesssbobetsv388agen baccaratsabung ayam onlinejudi bola onlinejudi bola onlinesabung ayam onlinemahjong ways 2mahjong wins 3lucky nekosweet bonanzastarlight princessjudi bola onlinesabung ayam onlineagen casino onlinecasino onlinejudi bola onlinesabung ayam onlinecasino onlinejudi bola onlinejudi bola onlinesabung ayam onlinecasino onlineslot gacor mahjonglive casino onlinesabung ayam onlinesabung ayam onlinecasino onlineslot gacor mahjongsabung ayam onlinejudi bola onlinejudi bolasabung ayam onlineindobit88live casino onlinesv388sabung ayam onlineCasino Onlinesabung ayam onlineMix Parlaycasino onlineMahjong Wayssabung ayam onlinejudi bola onlinesabung ayam onlinemix parlay
analisis pola spin mahjong ways untuk menang konsistenwild mahjong ways 3 ajarkan cara hadapi tantangan hidupputaran maxwin mahjong ways 2mengenal pola sukses mahjong wins 3strategi maxwin pemain mahjong wayspola campuran gate of olympusmenang pragmatic play pakai pola indojawa88strategi ammar menang mahjong ways 2bangkit dari kerugian berkat mahjong wins 3strategi Jackpot main clover goldJUDI BOLA ONLINESABUNG AYAM ONLINELIVE CASINO ONLINESLOT MAHJONGMAHJONG WAYS 2judi bola onlinejudi bola onlineslot mahjong wayssv388sbobet88live baccarat onlinesbobet mix parlaycara spin mahjong ways ala admin wahanabet tips mantul cuan tipis tapi lancar tanpa hambatanmenang rp.142.250.331 pakai kombinasi duo maut denny spill trik dapat cuan dari treasures of aztec modal bet cuma 400 perakSabung Ayam Onlinejudi bola onlinemahjong wayssabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlinejudi bolaslot mahjongsv388judi bolasabung ayam onlineungkap rahasia scatter mahjong wayskiat sukses pemain gates of olympustrik dan strategi kuasai rtp mahjong winspola lonceng emas queen of bountystrategi scatter hitam mahjong ways 2maxwin scatter hitam indojawa88
slot mahjong wayssabung ayam onlinejudi bola onlinesabung ayam onlinelive casino onlinejudi bola onlinesabung ayam onlinelive casino onlineSABUNG AYAM ONLINESBOBET88LIVE CASINO ONLINEMAHJONG WAYS 2JUDI BOLA ONLINESBOBET88SBOBETsv388sbobet88ws168sbobet mobilemahjong waysmodal nekat cuan dahsyat pakai pola ini spin sweet bonanza bisa menang puluhan jutajangan salah langkah main mahjong ways pakai trik ini scatter hitam pasti pecah terusSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineSabung Ayam OnlineLAB Official Work PageGalleria Pallesi Official SitePITAC Official ContactSOBER ICT Official Contactsabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlineSBOBETLIVE CASINO ONLINESBOBET88LIVE CASINO ONLINEJUDI BOLASABUNG AYAM ONLINESLOT MAHJONGLIVE CASINO ONLINESABUNG AYAM ONLINEMIX PARLAYCASINO ONLINESLOT MAHJONGSABUNG AYAM ONLINESBOBET88SABUNG AYAM ONLINELIVE CASINO ONLINELIVE CASINO ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINELIVE CASINO ONLINELIVE CASINO ONLINEAGEN JUDI BOLACASINO ONLINESLOT MAHJONGsabung ayam onlinemix parlaymix parlaymix parlaysabung ayam onlinemix parlaymix parlaysabung ayam onlinelive casinoSV388SBOBET88CASINO ONLINEPG SOFTSLOT GACORJUDI BOLA ONLINESITUS JUDI BOLASITUS JUDI BOLASABUNG AYAM ONLINEJUDI BOLA ONLINECASINO ONLINESLOT MAXWINSLOT GACORSBOBETSLOT MAHJONG WAYSSLOT KAKEK ZEUSSLOT SPACEMANBANDAR BOLAJUDI BOLASABUNG AYAMMEGA WHEELSLOT 4DSV388MAHJONG WAYSsabung ayam onlinejudi bola onlinelive casino onlineslot mahjonglive casino onlineslot mahjongsabung ayam onlinejudi bola onlinelive casino onlineslot mahjongJudi BolaLive Casino OnlineSabung Ayam onlineSlot Mahjong Gacorjudi bolaslot mahjonglive casinoindobit88slot onlineagen judi bolasabung ayam onlinejudi bolatogel onlinesbobet88sbobet88Mix Parlaycasino onlineMix ParlaySV388Judi Bola OnlineMahjong WaysSabung Ayam Onlinesabung ayam onlinejudi bola onlinemahjong ways 2slot mahjong waysmahjong ways 2mahjong wins 3lucky nekosweet bonanzastarlight princesssbobetsabung ayam onlineagen casinosabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlinemahjong ways 2mahjong wins 3starlight princesssweet bonanzagates of olympussabung ayam onlinesbobetagen casinoSLOT ZEUSSABUNG AYAM ONLINESABUNG AYAM ONLINELIVE CASINOSLOT MAHJONGcasino onlineslot zeusjudi bola onlinesabung ayam onlinesabung ayam onlinecasino onlineMIX PARLAYSV388INDOBALI88SABUNG AYAM ONLINESBOBET88WS168CASINO ONLINESBOBET88MIX PARLAYJUDI BOLAMAHJONG WAYS 2MAHJONG WAYSMAHJONG WINS 3POLA MAHJONG WAYSSITUS MAHJONG WAYS 2
Nusa Islands Bali Official PackagesTrinidad and Tobago Pilots’ Association Official About Pagemaxwin mahjong wins 3strategi main gates of olympuskuasai pola rtp pragmatic playlangkah mendapatkan scatter emaspola rtp pg soft indojawa88Green Gold Mountain Official SiteKomite SMKN 1 Tanjung Jabung Barat Official Sitetutorial maxwin mahjong waysstrategi rtp mahjong waysEIKON Official Policieskontak situs pecinta ayamNusa Islands Bali Official ContactCitraLand Surabaya Official NewsLenterakita About PageVinayak Group Official SiteI Think An Idea Official SitePITAC Official SitePortfolioSitez Official SiteMedical LTD Official SiteCapworks Official SiteMartino & Luth Official SiteTech With Mike First Official SiteSahabat Tiopan Official SiteE-Sekolah CBT Official SiteBDF Ventura Official SiteOcean E Soft Official SiteArab DMC Official SiteBBC Noun Official SiteCang Vu Hai Phong Official SiteThe Flat Official SiteThe Black Sheep Official SiteCEM Argentina Official SiteSlot MahjongTop Dawg Tavern Official SiteKelas Nesfatin Official SiteDuhoc Interlink Official SiteKarunia Inda Med Mandiri Official SiteJFV Pulm Official SiteRatiohead Official SiteAskona Official SiteMAN Surabaya E-Learning Official SiteShaker Group Official SiteTakaKawa Shoten Official SiteBrydan Solutions Official SiteConcursos Rodin Official SiteEHOB Official SiteConmou Official SiteCareer Wings Official SiteMontero Espinosa Official SiteBDF Ventura Official SiteDesa Sangginora Official SiteBDF Ventura Official SiteTaruna Akademia Official SiteAkura Official SiteMUI Ciamis Official SiteNamulanda Technical Institute Official Site