Nous Research launches toggle-on reasoning AI DeepHermes-3

Share This Post

[ad_1]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


AI reasoning models — those that produce “chains-of-thought” in text and reflect on their own analysis to try and catch errors midstream before outputting a response to a user — are all the rage now thanks to the likes of DeepSeek and OpenAI’s “o” series.

Still, it’s pretty incredible to me the speed at which the reasoning model approach has spread across the AI industry, with this week’s announcement that there’s yet another new model to try, this one from the mysterious yet laudably principled Nous Research collective of engineers, whose entire mission since launching in New York City in 2023 has been to make “personalized, unrestricted” AI models — often by taking and fine-tuning or retraining open source models such as Meta’s Llama series and those from French startup Mistral.

As posted on the Nous Research account on X and in the firm’s Discord channel, this new open reasoning model is called “DeepHermes-3 Preview,” and is described as an “LLM [large language model] that unifies reasoning and intuitive language model capabilities,” with the capability for the user to switch at will between longer reasoning processes and shorter, faster, less computationally demanding responses.

It’s an 8-billion parameter (settings count) variant of Hermes 3, itself a variant of Meta’s Llama released by Nous back in August 2024 with sample exchanges showing that it could enter into metacognition-like displays of thinking about itself and the role of AI compared to human consciousness, trigging something approaching an existential crisis in the model’s outputs.

Users can download the full model code on HuggingFace and a version that’s been quantized (reduced bit count) and saved in the GPT-Generated Unified Format (GGUF), which is designed to run model inferences (the actual production build, as opposed to training) on consumer-grade PCs and servers.

The Nous account today wrote that its researchers “hope our unique approach to user controlled, toggleable reasoning mode furthers our mission of giving those who use DeepHermes more steerability for whatever need they have.”

Building on Hermes 3: The Data and Training Approach

DeepHermes-3 builds upon the Hermes 3 dataset, a meticulously curated multi-domain dataset that Nous Research developed for the broader Hermes 3 series.

According to the Hermes 3 Technical Report released back in August, this dataset is composed of approximately 390 million tokens spanning diverse instructional and reasoning-based domains.

The dataset is broken down into the following key categories:

General Instructions (60.6%) – Broad, open-ended prompts similar to those found in general-purpose AI chat models.

Domain Expert Data (12.8%) – Specialized knowledge in fields like science, law, and engineering.

Mathematics (6.7%) – Advanced problem-solving datasets aimed at improving numerical and logical reasoning.

Roleplaying and Creative Writing (6.1%) – Data designed to enhance storytelling and simulated dialogue.

Coding and Software Development (4.5%) – Code generation and debugging tasks.

Tool Use, Agentic Reasoning, and Retrieval-Augmented Generation (RAG) (4.3%) – Training on function calling, planning, and knowledge retrieval.

Content Generation (3.0%) – Writing, summarization, and structured output tasks.

Steering and Alignment (2.5%) – Data focused on making the model highly steerable and responsive to user prompts.

In addition, the pseudonymous Nous Research team member @Teknium (@Teknium1 on X) wrote in response to a user of the company’s Discord server that the model was trained on “1m non cots and 150k cots,” or, 1 million non-chain-of-thought outputs and 150,000 chain-of-thought outputs.

This data mixture supports DeepHermes-3’s unique ability to toggle between intuitive responses and deep, structured reasoning, a key feature that distinguishes it from other LLMs.

How Toggleable Reasoning Mode Works

DeepHermes-3 allows users to control its reasoning depth using a system prompt. The user needs to enter the following text before a prompt to “toggle on” the model’s reasoning mode:

You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside tags, and then provide your solution or response to the problem.

When reasoning mode is enabled, the model processes information in long chains of thought, allowing it to deliberate systematically before generating an answer.

This is achieved using the <think></think> tags, where the model’s internal monologue is structured before presenting a final solution.

In standard response mode, the model operates more like a traditional AI chatbot, providing quicker, intuition-based responses without deep logical processing.

Performance Insights and Community Feedback

Early benchmarking and community testing have provided key insights into DeepHermes-3’s capabilities:

Mathematical Reasoning: DeepHermes-3 scores 67% on MATH benchmarks, compared to 89.1% for DeepSeek’s R1-distilled model. While DeepSeek outperforms it in pure math tasks, Nous Research positions DeepHermes-3 as a more generalist model with broader conversational and reasoning skills.

Multi-Turn Conversations: Some testers report that reasoning mode activates correctly on the first response but may fail to persist in extended conversations. Community members suggest enforcing <think>\n at the start of each response, a method also used in DeepSeek-R1.

Function Calling: DeepHermes-3 supports tool use, though it was not explicitly trained to integrate reasoning mode and function calling simultaneously. Some users report that while combining both features improves accuracy in executing tools, results remain inconsistent.

Nous Research is actively gathering user feedback to refine reasoning persistence and improve multi-turn interactions.

Deployment and Hardware Performance

DeepHermes-3 is available for testing on Hugging Face, with GGUF quantized versions optimized for low-power hardware. The model is compatible with vLLM for inference and uses Llama-Chat format for multi-turn dialogue.

One user reported a processing speed of 28.98 tokens per second on a MacBook Pro M4 Max, demonstrating that the model can run efficiently on consumer hardware.

DeepHermes-3 is based on Meta’s Llama 3 model and is governed by the Meta Llama 3 Community License. While the model is freely available for use, modification, and redistribution, certain conditions apply:

Redistribution: Any derivative models or deployments must include the original license and prominently display “Built with Meta Llama 3.”

Restrictions on Model Training: Users cannot use DeepHermes-3 (or Llama 3) to train other large language models, except for derivative works explicitly based on Llama 3.

• Commercial Licensing for Large Companies: Organizations with over 700 million monthly active users must obtain explicit approval from Meta before using the model commercially.

• Acceptable Use Policy: Users must comply with Meta’s AI usage restrictions, which prohibit applications in areas like misinformation, surveillance, and harmful content generation.

These redistribution rules and commercial limitations mean that DeepHermes-3 is not fully open-source in the traditional sense, despite its availability on Hugging Face, unlike Chinese rival DeepSeek’s hit R1 reasoning model, which is available under a permissive MIT License.

Looking ahead to Hermes 4

DeepHermes-3 was developed by @teknium, @emozilla, @Gifted Gummy Bee, @hjc-puro, and @jsupha, with Nous Research crediting the open-source community for contributions to datasets, evaluation tools, and model training.

Nous Research sees this preview model as a stepping stone toward the next major release, Hermes 4, which is expected to further refine its reasoning and conversational abilities.


[ad_2]
Source link

Related Posts

Online Gaming Platform Shutdown Scams: A Warning Report

The world of online gaming is filled with exciting...

Dive Into New Challenges and Win Big

Embrace the Excitement of Overcoming Challenges and Achieving Great...

Portal Breakers Enter the Fractured Universe

The universe is far larger and stranger than most...

Adios, Windows: These alternatives make switching from Microsoft easy

If you can’t install Windows 11 on your...
- Advertisement -spot_img
Slot Gacor Slot777slot mahjongslot mahjongjudi bola onlinesabung ayam onlinejudi bola onlinelive casino onlineslot danaslot thailandsabung ayam onlinejudi bola onlinesitus live casino onlineslot mahjong waysbandar togel onlinejudi bolasabung ayam onlinejudi bolaSABUNG AYAM ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINESABUNG AYAM ONLINEjudi bola onlineslot mahjong wayslive casino onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlinemahjong wayssabung ayam onlinesbobet88slot mahjongsabung ayam onlinesbobet mix parlayslot777judi bola onlinesabung ayam onlinesabung ayam onlinejudi bola onlinelive casino onlineslot mahjong waysjuara303juara303juara303juara303juara303juara303juara303juara303SV388Mix ParlayBLACKJACKSLOT777Sabung Ayam OnlineBandar Judi BolaAgen Sicbo Online
agen sabung ayamslot mahjong gacorsabung ayam onlinejudi bola onlinelive casino onlineslot mahjongsabung ayam onlinejudi bola onlinelive casino onlineslot mahjongslot mahjongsabung ayam onlinescatter hitamlive casino onlinemix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong waysmix parlaySBOBETSBOBETCASINO ONLINESBOBETSBOBET88SABUNG AYAM ONLINESBOBETagen judi bolalive casino onlinesabung ayam onlinejudi bola sbobetsabung ayam onlineSabung Ayam OnlineJudi Bola OnlineAgen Live Casino OnlineMahjong Ways 2Sabung Ayam OnlineJudi Bola OnlineAgen Live Casino OnlineMahjong Ways 2Sabung Ayam OnlineJudi Bola OnlineAgen Live Casino OnlineMahjong Ways 2slot gacorjudi bolamix parlayjudi bolasv388SABUNG AYAM ONLINELIVE CASINO ONLINEJUDI BOLAMAHJONG WAYSSLOT MAHJONGJUDI BOLA ONLINELIVE CASINO ONLINESABUNG AYAM ONLINE
SABUNG AYAM ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINEjudi bola onlinesabung ayam onlinelive casino onlinesitus toto 4djudi bola onlinejudi bola onlinesabung ayam onlinelive casino onlinejudi bola onlinemix parlaysbobet88sv388sbobet mix parlayws168sbobet88sv388sv388sbobet88sabung ayam onlinejudi bola onlinesabung ayam onlinesbobet mix parlaysabung ayam onlinejudi bola onlineslot gacorsabung ayam onlinejudi bola onlinelive casino onlineslot mahjong waysjuara303juara303juara303juara303juara303juara303juara303juara303juara303juara303juara303juara303juara303juara303juara303juara303SV388Mix ParlayLive Casino OnlineSitus Slot GacorSV388SBOBET WAPBlackjackPragmatic PlaySV388Judi Bola OnlineBlackjackKakek ZeusSV388Mix ParlayAgen BlackjackSlot Gacor Onlinesabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bolaslot mahjonglive casino onlinesabung ayam onlinejudi bola onlineslot mahjong gacorsitus toto togel 4Dsabung ayam onlinesitus toto togel 4Dsitus live casinojudi bola onlinesitus slot mahjongjudi bolasabung ayam onlinesabung ayam onlinemahjong wayssabung ayam onlinejudi bolasabung ayam onlinejudi bola
judi bola onlinejudi bola onlinejudi bola onlinejudi bola onlineJUDI BOLA ONLINESBOBET88JUDI BOLA ONLINEJUDI BOLA ONLINESV388Judi Bola OnlineBlackjackKakek ZeusSV388SBOBET WAPAgen BlackjackSlot Gacor Onlinejuara303juara303juara303juara303juara303juara303juara303juara303judi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bolasabung ayam onlinesabung ayam onlinejudi bola onlinesitus live casino onlineslot mahjong wayssabung ayam onlinesitus live casinojudi bola onlinedexel
Slot Mahjong Waysslot danaslot danaslot danasabung ayam onlinesabung ayam onlineJUDI BOLA ONLINESV388Mix ParlayAgen Casino OnlineSLOT777Sabung Ayam OnlineAgen Judi BolaLive Casino Onlinesabung ayam onlinesabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlinesitus live casino onlineagen togel onlineSabung Ayam OnlineJudi Bola OnlineSlot MahjongBandar togelSabung Ayam OnlineJudi Bola Onlinejudi bola onlinejudi bola onlinesabung ayam onlinelive casino onlineJUDI BOLA ONLINESBOBET88JUDI BOLA ONLINEmix parlaymix parlaylive casinosabung ayam onlinemix parlayslot danaslot mahjongslot mahjongjudi bolaMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINESABUNG AYAM ONLINESBOBETLIVE CASINO ONLINESLOT MAHJONG WAYSSABUNG AYAM ONLINEMIX PARLAYSABUNG AYAM ONLINESABUNG AYAM ONLINEWALA MERONWALA MERONSITUS SABUNG AYAMSITUS SABUNG AYAMjudi bola terpercayaSabung Ayam Onlinemix parlaySabung Ayam OnlineZeus Slot GacorSitus Judi BolaSabung Ayam Onlinesitus sabung ayamSlot MahjongSV388SBOBET88live casino onlineslot mahjong gacorSV388SBOBET88live casino onlineslot mahjong gacorSabung Ayam OnlineJudi Bola OnlineCasino OnlineMahjong Ways 2Sabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bolacasino onlinesv388sabung ayam onlinejudi bola onlineagen live casino onlinemahjong waysLIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINESITUS BOLASV388LIVE CASINO ONLINESLOT QRISSABUNG AYAM ONLINEMIX PARLAYMIX PARLAYJUDI BOLA ONLINESLOT MAHJONG
Mahjong Ways 2mahjong ways 2indojawa88daftar dan login wahanabetCapWorks Official ContactAynsley Official SitedexelHarifuku Clinic Official AccessNusa Islands Bali Official PackagesTrinidad and Tobago Pilots’ Association Official About PageNusa Islands Bali Official ContactCapworks Official SiteTech With Mike First Official SiteSahabat Tiopan Official SiteOcean E Soft Official SiteCang Vu Hai Phong Official SiteThe Flat Official SiteTop Dawg Tavern Official SiteDuhoc Interlink Official SiteRatiohead Official SiteMAN Surabaya E-Learning Official SiteShaker Group Official SiteTakaKawa Shoten Official SiteBrydan Solutions Official SiteConcursos Rodin Official SiteConmou Official SiteCareer Wings Official SiteMontero Espinosa Official SiteBDF Ventura Official SiteAkura Official SiteNamulanda Technical Institute Official Sitemenu home roasted coffeetosayama academy workshopjudi bola onlineContactez le Monaco Rugby Sevens - Club Professionnel à 7Virtual Eco Museum Official Event 2025DRT Seitai Official Contacta leading company in UWB technology development