Close Menu
Beverly Hills Examiner

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Greta Van Fleet Share Cryptic Video + Fans Think They Broke Up

    May 4, 2026

    Britney Spears pleads guilty to lesser ‘wet reckless’ charge in DUI case to avoid jail time

    May 4, 2026

    Young Voters Swing Back To Democrats As Republicans Are In Big Trouble

    May 4, 2026
    Facebook X (Twitter) Instagram
    Beverly Hills Examiner
    • Home
    • US News
    • Politics
    • Business
    • Science
    • Technology
    • Lifestyle
    • Music
    • Television
    • Film
    • Books
    • Contact
      • About
      • Amazon Disclaimer
      • DMCA / Copyrights Disclaimer
      • Terms and Conditions
      • Privacy Policy
    Beverly Hills Examiner
    Home»Science»Why DeepSeek’s AI Model Just Became the Top-Rated App in the U.S.
    Science

    Why DeepSeek’s AI Model Just Became the Top-Rated App in the U.S.

    By AdminJanuary 28, 2025
    Facebook Twitter Pinterest LinkedIn WhatsApp Email Reddit Telegram
    Why DeepSeek’s AI Model Just Became the Top-Rated App in the U.S.


    January 27, 2025

    3 min read

    Why DeepSeek’s AI Model Just Became the Top-Rated App in the U.S.

    A Chinese start-up has stunned the technology industry—and financial markets—with a cheaper, lower-tech AI assistant that matches the state of the art

    By Stephanie Pappas edited by Jeanna Bryner

    Person manipulating big data behind a screen.

    DeepSeek’s artificial intelligence assistant made big waves Monday, becoming the top-rated app in the Apple Store and sending tech stocks into a downward tumble. What’s all the fuss about?

    The Chinese start-up, DeepSeek, surprised the tech industry with a new model that rivals the abilities of OpenAI’s most recent model—with far less investment and using reduced-capacity chips. The U.S. bans exports of state-of-the-art computer chips to China and limits sales of chipmaking equipment. DeepSeek, based in the eastern Chinese city of Hangzhou, reportedly had a stockpile of high-performance Nvidia A100 chips from times prior to the ban—so its engineers could have used those to develop the model. But in a key breakthrough, the start-up says it instead used much lower-powered Nvidia H800 chips to train the new model, dubbed DeepSeek-R1.

    “We’ve seen up to now that the success of large tech companies working in AI was measured in how much money they raised, not necessarily in what the technology actually was,” says Ashlesha Nesarikar, the CEO of AI company Plano Intelligence, Inc. “I think we’ll be paying a lot more attention to what tech is underpinning these companies’ different products.”


    On supporting science journalism

    If you’re enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.


    On common AI tests in mathematics and coding, DeepSeek-R1 matched the scores of Open AI’s o1 model, according to VentureBeat. U.S. companies don’t disclose the cost of training their own large language models (LLMs), the systems that undergird popular chatbots such as ChatGPT. But OpenAI CEO Sam Altman told an audience at MIT in 2023 that training ChatGPT-4 cost over $100 million. DeepSeek-R1 is free for users to download, while the comparable version of ChatGPT costs $200 a month.

    DeepSeek’s $6 million number doesn’t necessarily reflect the cost of building a LLM from scratch, Nesarikar says; that cost may represent a fine-tuning of this latest version. Nevertheless, she says, the model’s improved energy efficiency would make AI more accessible to more people in more industries. The increase in efficiency could be good news when it comes to AI’s environmental impact, as the computation cost of generating new data with an LLM is four to five times higher than a typical search engine query.

    Because it requires less computational power, the cost of running DeepSeek-R1 is a tenth of the cost of similar competitors, says Hanchang Cao, an incoming assistant professor in Information Systems and Operations Management at Emory University. “For academic researchers or start-ups, this difference in the cost really means a lot,” Cao says.

    DeepSeek achieved its efficiency in several ways, says Anil Ananthaswamy, author of Why Machines Learn: The Elegant Math Behind Modern AI. The model has 670 billion parameters, or variables it learns from during training, making it the largest open-source large language model yet, Ananthaswamy explains. But the model uses an architecture called “mixture of experts” so that only a relevant fraction of these parameters—tens of billions instead of hundreds of billions—are activated for any given query. This cuts down on computing costs. The DeepSeek LLM also uses a method called multi-head latent attention to boost the efficiency of its inferences; and instead of predicting an answer word-by-word, it generates multiple words at once.

    The model further differs from others like o1 in how it reinforces learning during training. While many LLMs have an external “critic” model that runs alongside them, correcting errors and nudging the LLM toward verified answers, DeepSeek-R1 uses a set of rules internal to the model to teach it which of the possible answers it generates is best. “DeepSeek has streamlined that process,” Ananthaswamy says.

    Another important aspect of DeepSeek-R1 is that the company has made the code behind the product open-source, Ananthaswamy says. (The training data remains proprietary.) This means that the company’s claims can be checked. If the model is as computationally efficient as DeepSeek claims, he says, it will probably open up new avenues for researchers who use AI in their work to do so more quickly and cheaply. It will also enable more research into the inner workings of LLMs themselves.

    “One of the big things has been this divide that has opened up between academia and industry because academia has been unable to work with these really large models or do research in any meaningful way,” Ananthaswamy says. “But something like this, it’s within the reach of academia now, because you have the code.”



    Original Source Link

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Email Reddit Telegram
    Previous ArticleDouble Indemnity and the banality of evil
    Next Article Alice&Bob raises $104M, says quantum computers based on cat qubits will be ready by 2030

    RELATED POSTS

    NASA just dropped more than 12,000 Artemis II photos—here’s how to see them

    May 4, 2026

    Smart underwear detects lactose intolerance by tracking your farts

    May 4, 2026

    This Treatment Could Reverse Osteoarthritis Joint Damage With a Single Injection

    May 3, 2026

    Why the FDA rejected a ‘breakthrough’ melanoma drug

    May 3, 2026

    2026 will be the hottest year on record, leading scientist predicts

    May 2, 2026

    The Next Alzheimer’s Breakthrough Will Take More Than Just Science

    May 2, 2026
    latest posts

    Greta Van Fleet Share Cryptic Video + Fans Think They Broke Up

    Greta Van Fleet just shared a cryptic video on social media and some fans think…

    Britney Spears pleads guilty to lesser ‘wet reckless’ charge in DUI case to avoid jail time

    May 4, 2026

    Young Voters Swing Back To Democrats As Republicans Are In Big Trouble

    May 4, 2026

    NBA playoffs games to overlap next two nights, making the viewing experience even worse

    May 4, 2026

    What to Know About Sony’s $7.85 Million PlayStation Settlement

    May 4, 2026

    NASA just dropped more than 12,000 Artemis II photos—here’s how to see them

    May 4, 2026

    Wild Foxes review – a confident riff on the…

    May 4, 2026
    Categories
    • Books (1,220)
    • Business (6,126)
    • Cover Story (4)
    • Film (6,064)
    • Lifestyle (4,149)
    • Music (6,134)
    • Politics (6,126)
    • Science (5,475)
    • Technology (6,060)
    • Television (5,753)
    • Uncategorized (3)
    • US News (6,111)
    popular posts

    TV Ratings for Friday, June 17: ‘Dateline NBC’ Amber Heard Interview

    In the latest TV ratings, Dateline NBC‘s sitdown with Amber Heard on Friday night drew…

    Voyager 2 Gets a Life-Extending Power Boost in Deep Space

    May 16, 2023

    Jazz legend Nat King Cole’s great-nephew, Tracy Cole, 31, stabbed to death in Atlanta: ‘Everybody loved him’

    September 17, 2023

    How a Facial Plastic Surgeon Treats Bell’s Palsy

    August 15, 2024
    Archives
    Browse By Category
    • Books (1,220)
    • Business (6,126)
    • Cover Story (4)
    • Film (6,064)
    • Lifestyle (4,149)
    • Music (6,134)
    • Politics (6,126)
    • Science (5,475)
    • Technology (6,060)
    • Television (5,753)
    • Uncategorized (3)
    • US News (6,111)
    About Us

    We are a creativity led international team with a digital soul. Our work is a custom built by the storytellers and strategists with a flair for exploiting the latest advancements in media and technology.

    Most of all, we stand behind our ideas and believe in creativity as the most powerful force in business.

    What makes us Different

    We care. We collaborate. We do great work. And we do it with a smile, because we’re pretty damn excited to do what we do. If you would like details on what else we can do visit out Contact page.

    Our Picks

    NASA just dropped more than 12,000 Artemis II photos—here’s how to see them

    May 4, 2026

    Wild Foxes review – a confident riff on the…

    May 4, 2026

    ‘The View’ Alyssa Griffin Reveals Son Had Major Surgery, In ICU

    May 4, 2026
    © 2026 Beverly Hills Examiner. All rights reserved. All articles, images, product names, logos, and brands are property of their respective owners. All company, product and service names used in this website are for identification purposes only. Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Terms & Conditions and Privacy Policy.

    Type above and press Enter to search. Press Esc to cancel.

    We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
    Cookie SettingsAccept All
    Manage consent

    Privacy Overview

    This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
    Necessary
    Always Enabled
    Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
    CookieDurationDescription
    cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
    cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
    cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
    cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
    cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
    viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
    Functional
    Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
    Performance
    Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
    Analytics
    Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
    Advertisement
    Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
    Others
    Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
    SAVE & ACCEPT