Google open-sourced its watermarking tool for AI-generated text

1 month ago 21

Google’s SynthID substance watermarking technology, a instrumentality the institution created to marque AI-generated substance easier to identify, is present disposable open-source done the Google Responsible Generative AI Toolkit, the institution announced connected X.

“Now, different [generative] AI developers volition beryllium capable to usage this exertion to assistance them observe whether substance outputs person travel from their ain [large connection models], making it easier for much developers to physique AI responsibly,” Pushmeet Kohli, the vice president of probe astatine Google DeepMind, told MIT Technology Review.

Watermarks person go progressively important tools arsenic ample connection models are utilized to dispersed political misinformation, generate nonconsensual intersexual content, and for different malicious purposes. California’s already looking into making AI watermarking mandatory, portion China’s government started requiring it past year. Yet the tools are inactive a enactment successful progress.

SynthID, which was announced past August, helps marque AI-generated output detectable by adding an invisible watermark into images, audio, video, and substance arsenic they’re generated. Google says the substance mentation of SynthID works by making the substance output somewhat little probable successful a mode that is detectable by bundle but not humans:

An LLM generates substance 1 token astatine a time. These tokens tin correspond a azygous character, connection oregon portion of a phrase. To make a series of coherent text, the exemplary predicts the adjacent astir apt token to generate. These predictions are based connected the preceding words and the probability scores assigned to each imaginable token.

For example, with the operation “My favourite tropical fruits are __.” The LLM mightiness commencement completing the condemnation with the tokens “mango,” “lychee,” “papaya,” oregon “durian,” and each token is fixed a probability score. When there’s a scope of antithetic tokens to take from, SynthID tin set the probability people of each predicted token, successful cases wherever it won’t compromise the quality, accuracy and creativity of the output.

This process is repeated passim the generated text, truthful a azygous condemnation mightiness incorporate 10 oregon much adjusted probability scores, and a leafage could incorporate hundreds. The last signifier of scores for some the model’s connection choices combined with the adjusted probability scores are considered the watermark.

Google claims the system, which it’s already integrated into its Gemini chatbot, doesn’t compromise the quality, accuracy, creativity, oregon velocity of generated text, which has agelong been an contented with watermarking systems. Google says it tin enactment connected substance arsenic abbreviated arsenic 3 sentences, arsenic good as text that’s been cropped, paraphrased, oregon modified. But it struggles with abbreviated text, contented that’s been rewritten oregon translated, and adjacent responses to factual questions.

“SynthID isn’t a metallic slug for identifying AI generated content,” Google wrote successful a blog station successful May. “[But it] is an important gathering artifact for processing much reliable AI recognition tools and tin assistance millions of radical marque informed decisions astir however they interact with AI-generated content.”

Read Entire Article