Close Menu
Beverly Hills Examiner

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Eddie Vedder Covers “My City of Ruins” Following Trump’s Attack on Springsteen

    May 18, 2025

    Scott Bessent says tariff uncertainty is a tactic — otherwise countries ‘would play us in the negotiations’

    May 18, 2025

    Trump Is Now Trying To Destroy The Same Pro-Palestinian Americans Who Voted For Him

    May 18, 2025
    Facebook X (Twitter) Instagram
    Beverly Hills Examiner
    • Home
    • US News
    • Politics
    • Business
    • Science
    • Technology
    • Lifestyle
    • Music
    • Television
    • Film
    • Books
    • Contact
      • About
      • Amazon Disclaimer
      • DMCA / Copyrights Disclaimer
      • Terms and Conditions
      • Privacy Policy
    Beverly Hills Examiner
    Home»Science»How Game Theory Can Make AI More Reliable
    Science

    How Game Theory Can Make AI More Reliable

    By June 10, 2024
    Facebook Twitter Pinterest LinkedIn WhatsApp Email Reddit Telegram
    How Game Theory Can Make AI More Reliable


    Posing a far greater challenge for AI researchers was the game of Diplomacy—a favorite of politicians like John F. Kennedy and Henry Kissinger. Instead of just two opponents, the game features seven players whose motives can be hard to read. To win, a player must negotiate, forging cooperative arrangements that anyone could breach at any time. Diplomacy is so complex that a group from Meta was pleased when, in 2022, its AI program Cicero developed “human-level play” over the course of 40 games. While it did not vanquish the world champion, Cicero did well enough to place in the top 10 percent against human participants.

    During the project, Jacob—a member of the Meta team—was struck by the fact that Cicero relied on a language model to generate its dialog with other players. He sensed untapped potential. The team’s goal, he said, “was to build the best language model we could for the purposes of playing this game.” But what if instead they focused on building the best game they could to improve the performance of large language models?

    Consensual Interactions

    In 2023, Jacob began to pursue that question at MIT, working with Yikang Shen, Gabriele Farina, and his adviser, Jacob Andreas, on what would become the consensus game. The core idea came from imagining a conversation between two people as a cooperative game, where success occurs when a listener understands what a speaker is trying to convey. In particular, the consensus game is designed to align the language model’s two systems—the generator, which handles generative questions, and the discriminator, which handles discriminative ones.

    After a few months of stops and starts, the team built this principle up into a full game. First, the generator receives a question. It can come from a human or from a preexisting list. For example, “Where was Barack Obama born?” The generator then gets some candidate responses, let’s say Honolulu, Chicago, and Nairobi. Again, these options can come from a human, a list, or a search carried out by the language model itself.

    But before answering, the generator is also told whether it should answer the question correctly or incorrectly, depending on the results of a fair coin toss.

    If it’s heads, then the machine attempts to answer correctly. The generator sends the original question, along with its chosen response, to the discriminator. If the discriminator determines that the generator intentionally sent the correct response, they each get one point, as a kind of incentive.

    If the coin lands on tails, the generator sends what it thinks is the wrong answer. If the discriminator decides it was deliberately given the wrong response, they both get a point again. The idea here is to incentivize agreement. “It’s like teaching a dog a trick,” Jacob explained. “You give them a treat when they do the right thing.”

    The generator and discriminator also each start with some initial “beliefs.” These take the form of a probability distribution related to the different choices. For example, the generator may believe, based on the information it has gleaned from the internet, that there’s an 80 percent chance Obama was born in Honolulu, a 10 percent chance he was born in Chicago, a 5 percent chance of Nairobi, and a 5 percent chance of other places. The discriminator may start off with a different distribution. While the two “players” are still rewarded for reaching agreement, they also get docked points for deviating too far from their original convictions. That arrangement encourages the players to incorporate their knowledge of the world—again drawn from the internet—into their responses, which should make the model more accurate. Without something like this, they might agree on a totally wrong answer like Delhi, but still rack up points.



    Original Source Link

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Email Reddit Telegram
    Previous ArticleChina Box Office: ‘Furiosa,’ ‘Civil War’ Fizzle
    Next Article Apple WWDC 2024 Live Blog: All the News as It Happens

    RELATED POSTS

    Babies start showing empathy even before they can speak

    May 18, 2025

    Blocked From Selling Off-Brand Ozempic, Telehealth Startups Embrace a Less Effective Drug

    May 18, 2025

    First Personalized CRISPR Treatment Gives Baby New Lease on Life

    May 17, 2025

    US east coast faces rising seas as crucial Atlantic current slows

    May 17, 2025

    A Baby Received a Custom Crispr Treatment in Record Time

    May 16, 2025

    ‘Supersonic’ Planes Could Make a Comeback in the U.S. after Decades-Long Ban

    May 16, 2025
    latest posts

    Eddie Vedder Covers “My City of Ruins” Following Trump’s Attack on Springsteen

    During Pearl Jam’s concert in Pittsburgh on Friday, frontman Eddie Vedder performed a solo cover…

    Scott Bessent says tariff uncertainty is a tactic — otherwise countries ‘would play us in the negotiations’

    May 18, 2025

    Trump Is Now Trying To Destroy The Same Pro-Palestinian Americans Who Voted For Him

    May 18, 2025

    Brown line on fingernail helped catch cancer early, thanks to TikTok video

    May 18, 2025

    Heybike’s Alpha step-through e-bike is an affordable, all-terrain dreamboat

    May 18, 2025

    Babies start showing empathy even before they can speak

    May 18, 2025

    Inside The Hollywood Reporter’s ‘Die, My Love’ Cannes Premiere Party

    May 18, 2025
    Categories
    • Books (523)
    • Business (5,427)
    • Film (5,364)
    • Lifestyle (3,469)
    • Music (5,418)
    • Politics (5,413)
    • Science (4,775)
    • Technology (5,361)
    • Television (5,037)
    • Uncategorized (1)
    • US News (5,415)
    popular posts

    The Feast review ― a stylish but flat socio-horror

    The Feast review ― a stylish but flat socio-horror About Little White Lies Little White…

    Some of the many Boris Johnson scandals that rocked Britain

    July 6, 2022

    Power Book III: Raising Kanan Season 3 Episode 2 Review: Flipmode

    December 9, 2023

    Former ‘Glee’ Star Blake Jenner Arrested on DUI Charge

    July 18, 2022
    Archives
    Browse By Category
    • Books (523)
    • Business (5,427)
    • Film (5,364)
    • Lifestyle (3,469)
    • Music (5,418)
    • Politics (5,413)
    • Science (4,775)
    • Technology (5,361)
    • Television (5,037)
    • Uncategorized (1)
    • US News (5,415)
    About Us

    We are a creativity led international team with a digital soul. Our work is a custom built by the storytellers and strategists with a flair for exploiting the latest advancements in media and technology.

    Most of all, we stand behind our ideas and believe in creativity as the most powerful force in business.

    What makes us Different

    We care. We collaborate. We do great work. And we do it with a smile, because we’re pretty damn excited to do what we do. If you would like details on what else we can do visit out Contact page.

    Our Picks

    Babies start showing empathy even before they can speak

    May 18, 2025

    Inside The Hollywood Reporter’s ‘Die, My Love’ Cannes Premiere Party

    May 18, 2025

    ‘Pioneer Woman’ Ree Drummond’s Daughter Paige Is Married

    May 18, 2025
    © 2025 Beverly Hills Examiner. All rights reserved. All articles, images, product names, logos, and brands are property of their respective owners. All company, product and service names used in this website are for identification purposes only. Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Terms & Conditions and Privacy Policy.

    Type above and press Enter to search. Press Esc to cancel.

    We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
    Cookie SettingsAccept All
    Manage consent

    Privacy Overview

    This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
    Necessary
    Always Enabled
    Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
    CookieDurationDescription
    cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
    cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
    cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
    cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
    cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
    viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
    Functional
    Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
    Performance
    Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
    Analytics
    Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
    Advertisement
    Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
    Others
    Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
    SAVE & ACCEPT