Detecting Text Ghostwritten by Large Language Models – The Berkeley Artificial Intelligence Research Blog


The structure of Ghostbuster, our new state-of-the-art method for detecting AI-generated text.

Large language models like ChatGPT write impressively well—so well, in fact, that they’ve become a problem. Students have begun using these models to ghostwrite assignments, leading some schools to ban ChatGPT. In addition, these models are also prone to producing text with factual errors, so wary readers may want to know if generative AI tools have been used to ghostwrite news articles or other sources before trusting them.

What can teachers and consumers do? Existing tools to detect AI-generated text sometimes do poorly on data that differs from what they were trained on. In addition, if these models falsely classify real human writing as AI-generated, they can jeopardize students whose genuine work is called into question.

Our recent paper introduces Ghostbuster, a state-of-the-art method for detecting AI-generated text. Ghostbuster works by finding the probability of generating each token in a document under several weaker language models, then combining functions based on these probabilities as input to a final classifier. Ghostbuster doesn’t need to know what model was used to generate a document, nor the probability of generating the document under that specific model. This property makes Ghostbuster particularly useful for detecting text potentially generated by an unknown model or a black-box model, such as the popular commercial models ChatGPT and Claude, for which probabilities aren’t available. We’re particularly interested in ensuring that Ghostbuster generalizes well, so we evaluated across a range of ways that text could be generated, including different domains (using newly collected datasets of essays, news, and stories), language models, or prompts.

Read more

How insurers can win the race to AI maturity | Insurance Blog

[ad_1] Artificial intelligence has been around since the 1950s, but over the last several years the business potential of AI has expanded dramatically. We now live in a world where big data and powerful computational capabilities allow AI to flourish. Companies—including insurance carriers—are investing in establishing data lakes, optimizing for cloud-based operations and activating AI … Read more

How CarMax organized and scaled innovation with Microsoft AI solutions

[ad_1] With the adoption of generative AI spreading across industries, it has been fascinating to see how Microsoft customers are applying this groundbreaking technology to tackle their unique business challenges.   I spoke with Shamim Mohammad, Chief Information and Technology Officer at CarMax, on the Pivotal podcast about his company’s approach to data and AI. … Read more

Generating the policy of tomorrow | MIT News

[ad_1] As first-year students in the Social and Engineering Systems (SES) doctoral program within the MIT Institute for Data, Systems, and Society (IDSS), Eric Liu and Ashely Peake share an interest in investigating housing inequality issues. They also share a desire to dive head-first into their research. “In the first year of your PhD, you’re … Read more

Mini-robots modeled on insects may be smallest, lightest, fastest ever developed

[ad_1] Two insect-like robots, a mini-bug and a water strider, developed at Washington State University, are the smallest, lightest and fastest fully functional micro-robots ever known to be created. Such miniature robots could someday be used for work in areas such as artificial pollination, search and rescue, environmental monitoring, micro-fabrication or robotic-assisted surgery. Reporting on … Read more

Insurance News: 2022 in review | Insurance Blog

[ad_1] As we near the end of 2022, the insurance industry is responding to disruption across all lines of business. From customers concerned about crypto losses to employers still assessing the risks of COVID-19, insurers are finding ways to offer protection. In this final Insurance News Analysis of the year, Abbey Compton and I are … Read more

3 life insurance underwriting predictions for 2023 | Insurance Blog

[ad_1] As the insurance industry continues to navigate the pace of change, complexity and uncertainty in our world, consumers continue to respond, expecting companies to be more responsive to their needs. This year’s underwriting predictions offer guidance on how carriers can respond faster. 1.  Evolving cognitive technologies will help insurers capture opportunity from more discrete … Read more

Insurance News: Generative AI experience, efficiency and risk | Insurance Blog

[ad_1] Every day we see headlines about generative AI. For the insurance industry, this technology offers acceleration in many areas in which AI-led transformation is already underway. But, like any emerging technology, it also introduces new areas of risk. In this Insurance News Analysis, Abbey Compton and I are joined by Daria Lee Sharman for … Read more