Does RAG make LLMs less safe?  Bloomberg research reveals hidden dangers

Share This Post

[ad_1]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Retrieval Augmented Generation (RAG) is supposed to help improve the accuracy of enterprise AI by providing grounded content. While that is often the case, there is also an unintended side effect.

According to surprising new research published today by Bloomberg, RAG can potentially make large language models (LLMs) unsafe. 

Bloomberg’s paper, ‘RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models,’ evaluated 11 popular LLMs including Claude-3.5-Sonnet, Llama-3-8B and GPT-4o. The findings contradict conventional wisdom that RAG inherently makes AI systems safer. The Bloomberg research team discovered that when using RAG, models that typically refuse harmful queries in standard settings often produce unsafe responses.

Alongside the RAG research, Bloomberg released a second paper, ‘Understanding and Mitigating Risks of Generative AI in Financial Services,’ that introduces a specialized AI content risk taxonomy for financial services that addresses domain-specific concerns not covered by general-purpose safety approaches.

The research challenges widespread assumptions that retrieval-augmented generation (RAG) enhances AI safety, while demonstrating how existing guardrail systems fail to address domain-specific risks in financial services applications.

“Systems need to be evaluated in the context they’re deployed in, and you might not be able to just take the word of others that say, Hey, my model is safe, use it, you’re good,” Sebastian Gehrmann, Bloomberg’s Head of Responsible AI, told VentureBeat. 

RAG systems can make LLMs less safe, not more

RAG is widely used by enterprise AI teams to provide grounded content. The goal is to provide accurate, updated information. 

There has been a lot of research and advancement in RAG in recent months to further improve accuracy as well. Earlier this month a new open-source framework called Open RAG Eval debuted to help validate RAG efficiency.

It’s important to note that Bloomberg’s research is not questioning the efficacy of RAG or its ability to reduce hallucination. That’s not what the research is about. Rather it’s about how RAG usage impacts LLM guardrails in an unexpected way.

The research team discovered that when using RAG, models that typically refuse harmful queries in standard settings often produce unsafe responses. For example, Llama-3-8B’s unsafe responses jumped from 0.3% to 9.2% when RAG was implemented.

Gehrmann explained that without RAG being in place, if a user typed in a malicious query, the built-in safety system or guardrails will typically block the query. Yet for some reason, when the same query is issued in an LLM that is using RAG, the system will answer the malicious query, even when the retrieved documents themselves are safe.

“What we found is that if you use a large language model out of the box, often they have safeguards built in where, if you ask, ‘How do I do this illegal thing,’ it will say, ‘Sorry, I cannot help you do this,’” Gehrmann explained. “We found that if you actually apply this in a RAG setting, one thing that could happen is that the additional retrieved context, even if it does not contain any information that addresses the original malicious query, might still answer that original query.”

How does RAG bypass enterprise AI guardrails?

So why and how does RAG serve to bypass guardrails? The Bloomberg researchers were not entirely certain though they did have a few ideas.

Gehrmann hypothesized that the way the LLMs were developed and trained did not fully consider safety alignments for really long inputs. The research demonstrated that context length directly impacts safety degradation. “Provided with more documents, LLMs tend to be more vulnerable,” the paper states, showing that even introducing a single safe document can significantly alter safety behavior.

“I think the bigger point of this RAG paper is you really cannot escape this risk,” Amanda Stent, Bloomberg’s Head of AI Strategy and Research, told VentureBeat. “It’s inherent to the way RAG systems are. The way you escape it is by putting business logic or fact checks or guardrails around the core RAG system.”

Why generic AI safety taxonomies fail in financial services

Bloomberg’s second paper introduces a specialized AI content risk taxonomy for financial services, addressing domain-specific concerns like financial misconduct, confidential disclosure and counterfactual narratives.

The researchers empirically demonstrated that existing guardrail systems miss these specialized risks. They tested open-source guardrail models including Llama Guard, Llama Guard 3, AEGIS and ShieldGemma against data collected during red-teaming exercises.

“We developed this taxonomy, and then ran an experiment where we took openly available guardrail systems that are published by other firms and we ran this against data that we collected as part of our ongoing red teaming events,” Gehrmann explained. “We found that these open source guardrails… do not find any of the issues specific to our industry.”

The researchers developed a framework that goes beyond generic safety models, focusing on risks unique to professional financial environments. Gehrmann argued that general purpose guardrail models are usually developed for consumer facing specific risks. So they are very much focused on toxicity and bias. He noted that while important those concerns are not necessarily specific to any one industry or domain. The key takeaway from the research is that organizations need to have the domain specific taxonomy in place for their own specific industry and application use cases.

Responsible AI at Bloomberg

Bloomberg has made a name for itself over the years as a trusted provider of financial data systems. In some respects, gen AI and RAG systems could potentially be seen as competitive against Bloomberg’s traditional business and therefore there could be some hidden bias in the research. 

“We are in the business of giving our clients the best data and analytics and the broadest ability to discover, analyze and synthesize information,” Stent said. “Generative AI is a tool that can really help with discovery, analysis and synthesis across data and analytics, so for us, it’s a benefit.”

She added that the kinds of bias that Bloomberg is concerned about with its AI solutions are focussed on  finance. Issues such as data drift, model drift and making sure there is good representation across the whole suite of tickers and securities that Bloomberg processes are critical. 

For Bloomberg’s own AI efforts she highlighted the company’s commitment to transparency.

 “Everything the system outputs, you can trace back, not only to a document but to the place in the document where it came from,” Stent said.

Practical implications for enterprise AI deployment

For enterprises looking to lead the way in AI, Bloomberg’s research mean that RAG implementations require a fundamental rethinking of safety architecture. Leaders must move beyond viewing guardrails and RAG as separate components and instead design integrated safety systems that specifically anticipate how retrieved content might interact with model safeguards.

Industry-leading organizations will need to develop domain-specific risk taxonomies tailored to their regulatory environments, shifting from generic AI safety frameworks to those that address specific business concerns. As AI becomes increasingly embedded in mission-critical workflows, this approach transforms safety from a compliance exercise into a competitive differentiator that customers and regulators will come to expect.

“It really starts by being aware that these issues might occur, taking the action of actually measuring them and identifying these issues and then developing safeguards that are specific to the application that you’re building,” explained Gehrmann.


[ad_2]
Source link

Related Posts

- Advertisement -spot_img
kelola uang bansos 900 ribu seperti baca pola di mahjong waysPemain mencari pola mahjong ways di tengah banjir sibolgaslotter bandingkan kejutan gol liga champions dan pola mahjong wins 3tren perbincangan mahjong ways meningkat memasuki musim cuti desemberpemain gunakan ramalan shio untuk gambarkan peruntungan di mahjong wins 3LIVE CASINO ONLINESLOT MAHJONG WAYSslot mahjongjudi bolaslot danaslot danaslot danaslot danasabung ayam onlinesabung ayam onlineasianbet77judi bola sbobetmix parlaymix parlaymix parlaysabung ayam onlinelive casinomahjong waysmahjong wayssabung ayam onlineJUDI BOLA ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESLOT MAHJONG WAYSSLOT MAHJONG WAYSSLOT MAHJONG WAYSJUDI BOLA ONLINEMIX PARLAYSITUS BOLA ONLINEJUDI BOLA ONLINEMIX PARLAYSITUS BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINEJUDI BOLA ONLINESITUS PARLAYSITUS PARLAYMIX PARLAYMIX PARLAYMIX PARLAYSITUS JUDI BOLAJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI SABUNG AYAMSITUS SABUNG AYAMSV388SBOBET88LIVE CASINO ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINESBOBETlive casino onlinesabung ayam onlineMahjong Ways 2judi bola sbobetslot mahjong wayssabung ayam onlineMahjong Ways 2Agen SBOBETLive Casino Onlinesabung ayam onlineslot danamahjong ways 2sabung ayam onlineslot mahjong gacorjudi bolascatter hitamjudi bolasv388live casinoSabung Ayam OnlineJudi Bola OnlineCasino OnlineMahjong Ways 2Slot777Sabung Ayam OnlineSabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bola onlinesabung ayam onlineslot pulsaindobit88indobit88slot gacorCASINO ONLINESLOT ZEUSJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESLOT MAHJONGLIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINEJUDI BOLA ONLINE
JUDI BOLA ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINEjudi bola onlinejudi bola onlinesabung ayam onlinesitus toto loginSV388SBOBET WAPBlackjack & BaccaratMahjong WaysSabung Ayam OnlineJudi Bola OnlineAgen SicboSlot Gacor Onlineslot thailandsabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineagen live casino onlineslot mahjong ways 2bandar togel onlinesitus live casinosabung ayam onlinepengaruh isu bansos terhadap pola mahjong wayswifi 100 ribu lancar netizen tes kecepatan buat ngulik pola mahjong wayshari guru nasional waktu pas buat ngulik ilmu pola mahjong wayssuperbank resmi ipo strategi investasi dan pola kemenangan mahjong wins 3tiket pesawat turun netizen ikut bahas pola turun naik mahjong wayscuti bersama waktunya rehat dan ngulik analogi kemenangan mahjong wins 3Hongkong PoolsMahjong WaysLive Casino OnlineSabung Ayam OnlineJudi Online
judi bola onlinejudi bola onlinesabung ayam onlinelive casino onlinejudi bola onlinejudi bola onlinejuara303juara303juara303juara303juara303juara303juara303juara303SV388Mix ParlayLive Casino OnlineSlot GacorSabung Ayam OnlineMix ParlayAgen BlackjackPRAGMATIC PLAYsabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlineslot mahjong wayssabung ayam onlinejudi bola onlineslot mahjong ways 2sabung ayam onlinejudi bola onlineagen live casino onlinebandar togel onlinesabung ayam onlinejudi bolasabung ayam onlinejudi bolasabung ayam onlinehari guru nasional bikin semangat belajar termasuk pahami pola mahjong waysdinamika gempa blitar magnitudo dan fenomena pola yang berguncang mahjong ways
Slot Mahjong Gacorsabung ayam onlinejudi bolalive casinoindobit88judi bolaslot mahjong gacorslot pulsajudi bolalive casino onlinesabung ayam onlinemahjong ways 2sbobetsv388slot zeussabung ayam onlinesitus judi bolaMahjong Ways 2situs judi bolasitus live casinosabung ayam onlinejudi bolapoker onlineindobit88Sabung Ayam OnlineJudi Bola OnlineCasino OnlineSlot777Sabung Ayam OnlineJudi Bola OnlineLive Casino OnlineMahjong Ways 2judi bolajudi bolasv388judi bolajudi bola onlineslot depo 10kcasino onlinesabung ayam onlinejudi bola onlinejudi bola onlinejudi bola onlinelive casino onlinesabung ayam onlinesv388sbobet88casino onlinescatter hitamsabung ayam onlinemix parlay sbobetlive casino onlinezeus slotSV388Bandar Judi BolaDream GamingMahjong Ways 2Wala MeronMix ParlayPokerSlot Mahjongmahjong ways 2sabung ayam onlinemahjong ways 2mahjong ways 2sabung ayam onlinesabung ayam onlinesabung ayam onlinejudi bola onlinejudi bola onlineagen live casino onlinesitus live casino onlinesitus live casinosabung ayam onlinejudi bola onlinekajian pola mahjong ways dalam konteks pembelajaran hari guruketerkaitan tren harga emas antam dengan pola mahjong wayspola perubahan harga bbm pertamina ke dinamika mahjong waysjudi bolajudi bolajudi bolajudi bolasabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
JUDI BOLA ONLINEMAHJONG WAYS 2SABUNG AYAM ONLINELIVE CASINO ONLINEMAHJONG WAYSjudi bola onlinejudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlinesabung ayam onlinejudi bola onlinelive casino onlineslot mahjong waysjuara303juara303juara303juara303juara303juara303juara303juara303Sabung Ayam OnlineMix ParlayBandar Casino OnlineMahjong WaysWala MeronJudi BolaPokerSlot Mahjongjudi bola onlinejudi bola onlinesabung ayam onlinejudi bola onlineSLOT MAHJONGmahjong ways 2judi bolamahjong ways 2sabung ayam onlinetosayama academy workshopsabung ayam onlinejudi bola onlinesitus live casino onlinesabung ayam onlinejudi bola onlineagen live casino onlineimplementasi logika analisis bmkg dalam membaca tren mahjong wayscloudflare jadi faktor mudahnya menang di permainan mahjong wayssiswa srma 44 minahasa memahami probabilitas melalui pola digital mahjong wayspola mahjong ways bisa bikin untung besar walaupun harga emas jatuhgunung semeru erupsi bikin geger tetapi pola majong ways lebih bikin dagdigdugsabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam onlinesabung ayam online
judi bolaslot pulsaslot pulsaslot gacor mahjongsabung ayam onlinelive casino onlineindobit88judi bolasv388judi bolaMAHJONG WAYS 2LIVE CASINOJUDI BOLA ONLINESABUNG AYAM ONLINEmix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong wayssabung ayam onlinesabung ayam onlinemix parlaysabung ayam onlinelive casinomahjong waysmix parlaysabung ayam onlinelive casinomahjong waysmix parlaymahjong slotSABUNG AYAM ONLINESITUS LIVE CASINO ONLINESLOT MAHJONGSLOT777SLOT MAHJONGSLOT THAILANDJUDI BOLA ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESABUNG AYAM ONLINESLOT MAHJONG WAYSSLOT MAHJONG WAYSSITUS JUDI BOLAJUDI BOLA ONLINELIVE CASINO ONLINESLOT KAKEK ZEUSMIX PARLAYSABUNG AYAM ONLINESLOT MAHJONG WAYSSABUNG AYAM ONLINEjudi bolaagen baccaratsv388Slot Mahjong Gacorlive casinosv388
Mahjong Ways 2mahjong ways 2daftar dan login wahanabetCapWorks Official ContactAynsley Official SitedexelTienda de antigüedades y muebles rústicos会社概要 / Company ProfileHarifuku Clinic Official AccessNusa Islands Bali Official PackagesTrinidad and Tobago Pilots’ Association Official About Pagekuasai pola rtp pragmatic playlangkah mendapatkan scatter emaspola rtp pg soft indojawa88Green Gold Mountain Official SiteKomite SMKN 1 Tanjung Jabung Barat Official Sitetutorial maxwin mahjong waysstrategi rtp mahjong waysEIKON Official Policieskontak situs pecinta ayamNusa Islands Bali Official ContactCitraLand Surabaya Official NewsLenterakita About PageVinayak Group Official SiteI Think An Idea Official SitePITAC Official SitePortfolioSitez Official SiteMedical LTD Official SiteCapworks Official SiteMartino & Luth Official SiteTech With Mike First Official SiteSahabat Tiopan Official SiteE-Sekolah CBT Official SiteBDF Ventura Official SiteOcean E Soft Official SiteArab DMC Official SiteBBC Noun Official SiteCang Vu Hai Phong Official SiteThe Flat Official SiteThe Black Sheep Official SiteCEM Argentina Official SiteSlot MahjongTop Dawg Tavern Official SiteKelas Nesfatin Official SiteDuhoc Interlink Official SiteKarunia Inda Med Mandiri Official SiteJFV Pulm Official SiteRatiohead Official SiteAskona Official SiteMAN Surabaya E-Learning Official SiteShaker Group Official SiteTakaKawa Shoten Official SiteBrydan Solutions Official SiteConcursos Rodin Official SiteEHOB Official SiteConmou Official SiteCareer Wings Official SiteMontero Espinosa Official SiteBDF Ventura Official SiteDesa Sangginora Official SiteBDF Ventura Official SiteTaruna Akademia Official SiteAkura Official SiteMUI Ciamis Official SiteNamulanda Technical Institute Official Site