Work@Microsoft    Study@UW.edu    Live@Seattle

A few UW students hacked the Google Perspective API

A few UW students hacked the Google Perspective API
5 (100%) 2 votes

Last Friday Google and a company called Jigsaw launched a project, called “Perspective”, that uses ML to detect trolling on social media.  The idea is to evaluate text in Social Media and estimate a “toxicity” score that presumably can be used to identify and take stuff down.

On Saturday, Feb 25, a few grad students in University of Washington found out they could use malformed inputs to fool the service and either lower the toxicity score of clearly inappropriate content, or raise the score of appropriate content. Here is the paper by these UW students: Deceiving Google’s Perspective API Built for Detecting Toxic Comments .


Leave a Comment

Your email address will not be published. Required fields are marked *

Loading...
ScottGe.net