Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    As Anthropic suspends access to new models, India debates its AI future

    June 14, 2026

    Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand

    June 14, 2026

    KPMG pulls report on AI usage due to apparent hallucinations

    June 13, 2026
    Facebook Twitter Instagram
    • Tech
    • Gadgets
    • Spotlight
    • Gaming
    Facebook Twitter Instagram
    iGadgets TechiGadgets Tech
    Subscribe
    • Home
    • Gadgets
    • Insights
    • Apps

      As Anthropic suspends access to new models, India debates its AI future

      June 14, 2026

      Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand

      June 14, 2026

      KPMG pulls report on AI usage due to apparent hallucinations

      June 13, 2026

      Amazon CEO reportedly raised Anthropic model concerns before government crackdown

      June 13, 2026

      This thin under-pillow speaker helped me fall asleep without earbuds

      June 13, 2026
    • Gear
    • Mobiles
      1. Tech
      2. Gadgets
      3. Insights
      4. View All

      The FCC Wants to Kill Burner Phones

      June 13, 2026

      EcoFlow PowerOcean Battery Review: Cutting My Bill in Half

      June 13, 2026

      Meet the New Dyson Vacuums: V16 Piston Animal, V10 Konical, V8 Cyclone (2026)

      June 13, 2026

      How Can Soccer Players Bend Their Shots in Midair?

      June 13, 2026

      March Update May Have Weakened The Haptics For Pixel 6 Users

      April 2, 2022

      Project 'Diamond' Is The Galaxy S23, Not A Rollable Smartphone

      April 2, 2022

      The At A Glance Widget Is More Useful After March Update

      April 2, 2022

      Pre-Order The OnePlus 10 Pro For Just $1 In The US

      April 2, 2022

      Motorola Edge+ Review: It Checks A Lot Of Boxes

      April 2, 2022

      This Smartphone Concept Design Is Different… In A Good Way

      April 2, 2022

      Twitter Just Made Searching Your Direct Messages Better

      April 2, 2022

      That Netflix Price Hike Is Starting To Take Place

      April 2, 2022

      Latest Huawei Mobiles P50 and P50 Pro Feature Kirin Chips

      January 15, 2021

      Samsung Galaxy M62 Benchmarked with Galaxy Note10’s Chipset

      January 15, 2021
      9.1

      Review: T-Mobile Winning 5G Race Around the World

      January 15, 2021
      8.9

      Samsung Galaxy S21 Ultra Review: the New King of Android Phones

      January 15, 2021
    • Computing
    iGadgets TechiGadgets Tech
    Home»Apps»Can tech companies learn to love cheaper AI models? 
    Apps

    Can tech companies learn to love cheaper AI models? 

    adminBy adminJune 9, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Modern data center with servers with lights on them.
    Share
    Facebook Twitter LinkedIn Pinterest Email

    The AI boom has been built on a basic assumption: bigger models are more powerful, and the most powerful models win. Now, the industry is about to learn what happens if that assumption starts to break.  

    Mounting costs have already pressured users to give smaller and cheaper models a second look. This cost-conscious model-shopping is new and it’s unclear how it will affect the industry, but the impact is likely to be significant. 

    One prediction, laid out best by Coinbase co-founder Brian Armstrong, is that it will result in the vast majority of tasks shifting to cheaper models. 

    “Demand for intelligence is near infinite, but 80% of workloads will be running on 99% cheaper models within 12-18 months,” Armstrong wrote on X. “20% of workloads will still run on latest gen models where IQ maxing is important.” 

    It’s hard to overstate what a significant shift it will be for the AI industry if Armstrong’s prediction comes true.  

    Before now, most AI companies have competed on quality, which has meant defaulting to the most advanced available model. If those same jobs can be handled by cheaper models without affecting quality, it would mean a massive shift in the economics of AI. And critically, much of the savings would be coming out of the pockets of the big labs, dealing a financial blow to OpenAI and Anthropic just as they’re heading for their IPOs. 

    It’s a potentially seismic change in the industry, resting on one basic question: Are companies ready to switch to smaller models? 

    Initial tests suggest that, when the system is arranged right, cheaper models could sub in without any sacrifice in quality. In a recent test by the legal AI tool Harvey, the company was able to reduce inference costs by 3x without reducing quality. The test, performed in partnership with the inference platform Fireworks AI, combined Claude Opus and Fireworks’ GLM 5.1, and shifted to Opus for the most intensive tasks. The result was a significantly lower load in terms of server time and overall cost. 

    “Quality comes first, and in legal it always will,” Harvey co-founder Gabe Pereyra told TechCrunch, referring to the AI legal services his startup provides. “However, the definition of quality is evolving from simply using the most powerful model for everything, to using the best model that gets the right answer most efficiently.”

    This trend is often framed in terms of major labs versus Chinese models or open-weight ones, but that misses the bigger point. The real divide isn’t between proprietary and open models; it’s between large models and small ones. You can save money by switching from GPT-5.5 to DeepSeek’s V4 Flash, but switching to GPT-5.4-mini works just as well.  

    There’s an active price war going on between in-house inference from the big labs and independently served open-weight models. For the bigger question of small versus large, it doesn’t really matter which kind of small model wins out.  

    All of this might seem obvious — of course you shouldn’t use more compute than necessary — but it runs counter to the scaling-first approach that has dominated the industry until now. Inspired by the bitter lesson, labs have leaned hard into training the most compute-intensive models possible, pushing the frontier of what AI models can do. With prices heavily subsidized by investors, clients had no reason to choose anything but the most advanced option.

    With token prices rising and subsidies slowing down, users are facing cost pressure for the first time. We don’t know whether the new cost pressure will actually drive enterprise users to smaller models. They could just as easily economize by making fewer calls, using less context, or simply giving up on the least promising deployments. 

    But if it turns out that most deployments can be run just as well on a smaller model, it could put a serious damper on the growing demand for inference – and raise new questions about how to justify the cost of training a frontier model. 

    When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

    AI,TC,ai models,Anthropic,Harvey,OpenAIai models,Anthropic,Harvey,OpenAI#tech #companies #learn #love #cheaper #models1781031954

    ai models Anthropic cheaper companies Harvey learn Love models OpenAI tech
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    admin
    • Website
    • Tumblr

    Related Posts

    As Anthropic suspends access to new models, India debates its AI future

    June 14, 2026

    Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand

    June 14, 2026

    KPMG pulls report on AI usage due to apparent hallucinations

    June 13, 2026
    Add A Comment

    Leave A Reply Cancel Reply

    Editors Picks
    8.5

    Apple Planning Big Mac Redesign and Half-Sized Old Mac

    January 5, 2021

    Autonomous Driving Startup Attracts Chinese Investor

    January 5, 2021

    Onboard Cameras Allow Disabled Quadcopters to Fly

    January 5, 2021
    Top Reviews
    9.1

    Review: T-Mobile Winning 5G Race Around the World

    By admin
    8.9

    Samsung Galaxy S21 Ultra Review: the New King of Android Phones

    By admin
    8.9

    Xiaomi Mi 10: New Variant with Snapdragon 870 Review

    By admin
    Advertisement
    Demo
    iGadgets Tech
    Facebook Twitter Instagram Pinterest Vimeo YouTube
    • Home
    • Tech
    • Gadgets
    • Mobiles
    • Our Authors
    © 2026 ThemeSphere. Designed by WPfastworld.
    "korean kbj​ "korean bj "koreanbj​

    Type above and press Enter to search. Press Esc to cancel.