Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    19 Best Gifts for Plant Lovers and Gardeners (2026)

    June 14, 2026

    The Strait of Hormuz Has Been Closed for 100 Days. Why Aren’t Oil Prices Higher?

    June 14, 2026

    Škoda’s New EV Will Likely Be Its Most Expensive Yet

    June 14, 2026
    Facebook Twitter Instagram
    • Tech
    • Gadgets
    • Spotlight
    • Gaming
    Facebook Twitter Instagram
    iGadgets TechiGadgets Tech
    Subscribe
    • Home
    • Gadgets
    • Insights
    • Apps

      As Anthropic suspends access to new models, India debates its AI future

      June 14, 2026

      Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand

      June 14, 2026

      KPMG pulls report on AI usage due to apparent hallucinations

      June 13, 2026

      Amazon CEO reportedly raised Anthropic model concerns before government crackdown

      June 13, 2026

      This thin under-pillow speaker helped me fall asleep without earbuds

      June 13, 2026
    • Gear
    • Mobiles
      1. Tech
      2. Gadgets
      3. Insights
      4. View All

      19 Best Gifts for Plant Lovers and Gardeners (2026)

      June 14, 2026

      The Strait of Hormuz Has Been Closed for 100 Days. Why Aren’t Oil Prices Higher?

      June 14, 2026

      Škoda’s New EV Will Likely Be Its Most Expensive Yet

      June 14, 2026

      The FCC Wants to Kill Burner Phones

      June 13, 2026

      March Update May Have Weakened The Haptics For Pixel 6 Users

      April 2, 2022

      Project 'Diamond' Is The Galaxy S23, Not A Rollable Smartphone

      April 2, 2022

      The At A Glance Widget Is More Useful After March Update

      April 2, 2022

      Pre-Order The OnePlus 10 Pro For Just $1 In The US

      April 2, 2022

      Motorola Edge+ Review: It Checks A Lot Of Boxes

      April 2, 2022

      This Smartphone Concept Design Is Different… In A Good Way

      April 2, 2022

      Twitter Just Made Searching Your Direct Messages Better

      April 2, 2022

      That Netflix Price Hike Is Starting To Take Place

      April 2, 2022

      Latest Huawei Mobiles P50 and P50 Pro Feature Kirin Chips

      January 15, 2021

      Samsung Galaxy M62 Benchmarked with Galaxy Note10’s Chipset

      January 15, 2021
      9.1

      Review: T-Mobile Winning 5G Race Around the World

      January 15, 2021
      8.9

      Samsung Galaxy S21 Ultra Review: the New King of Android Phones

      January 15, 2021
    • Computing
    iGadgets TechiGadgets Tech
    Home»Apps»Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts
    Apps

    Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

    adminBy adminMay 10, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    The Claude logo is displayed on a smartphone screen placed on a reflective surface onto which a multitude of Claude logos are projected.
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

    Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

    Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

    The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

    What accounts for the difference? The company said it found that “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.”

    Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.”

    “Doing both together appears to be the most effective strategy,” the company said.

    Techcrunch event

    San Francisco, CA
    |
    October 13-15, 2026

    AI,Anthropic,ClaudeAnthropic,Claude#Anthropic #evil #portrayals #responsible #Claudes #blackmail #attempts1778445875

    Anthropic Attempts blackmail Claude Claudes Evil portrayals responsible
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    admin
    • Website
    • Tumblr

    Related Posts

    As Anthropic suspends access to new models, India debates its AI future

    June 14, 2026

    Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand

    June 14, 2026

    KPMG pulls report on AI usage due to apparent hallucinations

    June 13, 2026
    Add A Comment

    Leave A Reply Cancel Reply

    Editors Picks
    8.5

    Apple Planning Big Mac Redesign and Half-Sized Old Mac

    January 5, 2021

    Autonomous Driving Startup Attracts Chinese Investor

    January 5, 2021

    Onboard Cameras Allow Disabled Quadcopters to Fly

    January 5, 2021
    Top Reviews
    9.1

    Review: T-Mobile Winning 5G Race Around the World

    By admin
    8.9

    Xiaomi Mi 10: New Variant with Snapdragon 870 Review

    By admin
    8.9

    Samsung Galaxy S21 Ultra Review: the New King of Android Phones

    By admin
    Advertisement
    Demo
    iGadgets Tech
    Facebook Twitter Instagram Pinterest Vimeo YouTube
    • Home
    • Tech
    • Gadgets
    • Mobiles
    • Our Authors
    © 2026 ThemeSphere. Designed by WPfastworld.
    "korean kbj​ "korean bj "koreanbj​

    Type above and press Enter to search. Press Esc to cancel.