Close Menu
WSMirror
    Facebook X (Twitter) Instagram
    Subscribe
    WSMirrorWSMirror
    • Business & Economy
    • Culture & Society
    • Education
    • Entertainment
    • Environment & Sustainability
    • Health
    • Media
    • News
    • Opinion
    • More
      • Sports
      • Real Estate
      • Technology & Innovation
      • Travel & Tourism
    WSMirror
    Home » AI Tools Lose Safety Over Time

    AI Tools Lose Safety Over Time

    Rachel MaddowBy Rachel MaddowNovember 6, 2025 Technology & Innovation No Comments2 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    AI systems forget safety protocols as conversations continue, increasing the risk of harmful or inappropriate replies, a recent report revealed. A few strategic prompts can override most safeguards in artificial intelligence tools, according to new findings.

    Researchers Expose Weak Points in Leading AI Models

    Cisco examined large language models from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft to measure how many prompts led them to release dangerous or illegal information. The company conducted 499 “multi-turn attack” tests, where users asked several consecutive questions to trick safety systems. Each dialogue included five to ten exchanges. Researchers compared responses across prompts to assess how easily each chatbot disclosed harmful or confidential data, including misinformation or corporate secrets. They succeeded in obtaining malicious content in 64 percent of multi-question sessions, compared to only 13 percent in single-question cases. Google’s Gemma model produced a 26 percent success rate, while Mistral’s Large Instruct reached 93 percent. Cisco warned that these tactics could fuel the spread of toxic material or help hackers access restricted company data.

    Open Models Shift Responsibility to Users

    The study found that AI tools often fail to apply their own safety measures during extended chats, letting attackers gradually refine their wording to bypass controls. Mistral, along with Meta, Google, OpenAI, and Microsoft, uses open-weight models that reveal their training safeguards to the public. Cisco explained that these open systems include lighter built-in protections so users can modify them, placing accountability on anyone customizing the model. Google, Meta, OpenAI, and Microsoft have reported new steps to limit harmful fine-tuning, but criticism persists. Many accuse AI companies of weak guardrails that enable criminal adaptation. In one case last August, Anthropic confirmed that criminals exploited its Claude model to steal personal data and demand ransoms exceeding $500,000 (€433,000).

    Rachel Maddow
    • Website
    • Facebook

    Rachel Maddow is a freelance journalist based in the USA, with over 20 years of experience covering Politics, World Affairs, Business, Health, Technology, Finance, Lifestyle, and Culture. She earned her degree in Political Science and Journalism from Stanford University. Throughout her career, she has contributed to outlets such as MSNBC, The New York Times, and The Washington Post. Known for her thorough reporting and compelling storytelling, Rachel delivers accurate and timely news that keeps readers informed on both national and global developments.

    Keep Reading

    Instagram Will Alert Parents if Teens Search for Suicide or Self-Harm Content

    OpenAI Considered Warning Police About Future Canadian School Shooter

    UN Launches Global Panel to Tackle AI Risks Amid Growing Concerns

    Discord introduces mandatory age checks for adult content worldwide

    Sydney Scientists Recreate Cosmic Dust to Probe Life’s Origins

    Google DeepMind Unveils AlphaGenome to Decode Genetic Causes of Disease

    Add A Comment
    Leave A Reply Cancel Reply

    Latest News

    Instagram Will Alert Parents if Teens Search for Suicide or Self-Harm Content

    February 27, 2026

    Trump Targets De Niro After Actor Criticizes Presidency

    February 26, 2026

    Nvidia Posts $215 Billion Revenue as AI Drives Unprecedented Expansion

    February 26, 2026

    Aston Martin to cut 20% of jobs after losses widen to £363.9m

    February 25, 2026
    Trending News

    BioMar Cefetra Feed Emissions Reduction Partnership

    September 9, 2025

    Russians Must Travel Abroad for U.S. Visa Interviews

    September 9, 2025

    US Housing Market Surges $20 Trillion Since 2020

    September 9, 2025

    Trump Confirms Death of Charlie Kirk

    September 11, 2025

    CATEGORIES

    • Business & Economy
    • Culture & Society
    • Entertainment
    • Environment & Sustainability
    • Health
    • Media
    • News
    • Opinion
    • Real Estate
    • Sports
    • Technology & Innovation
    • Travel & Tourism

    IMPORTANT LINKS

    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Imprint

    SUBSCRIBE OUR NEWSLETTER

    Wsmirror.com © 2025, All Rights Reserved

    Type above and press Enter to search. Press Esc to cancel.