Google cousin develops technology to flag toxic online comments

Published Thu, Feb 23 20179:51 AM ESTUpdated Thu, Feb 23 201710:36 AM EST

Daisuke Wakabayashi

VIDEO5:0605:06

Using AI to combat cyber bullies

From self-driving cars to multi-language translation, machine learning is underpinning many of the technology industry's biggest advances with its form of artificial intelligence.

Now, Google's parent company, Alphabet, says it plans to apply machine learning technology to promote more civil discourse on the internet and make comment sections on sites a little less awful.

Read more from The New York Times:

Alphabet's profits stay predictably good in a volatile industry
A more money-conscious Alphabet emerges
Google to reorganize as Alphabet to keep its lead as an innovator

Jigsaw, a technology incubator within Alphabet, says it has developed a new tool for web publishers to identify toxic comments that can undermine a civil exchange of ideas. Starting Thursday, publishers can start applying for access to use Jigsaw's software, called Perspective, without charge.

"We have more information and more articles than any other time in history, and yet the toxicity of the conversations that follow those articles are driving people away from the conversation," said Jared Cohen, president of Jigsaw, formerly known as Google Ideas.

Jigsaw President Jared Cohen

Mike Pont | Getty Images

Discussion in comments sections often devolves into an offensive and hateful exchange unless it is carefully managed. This has prompted some publishers to turn off the comments section on articles because moderating them can be time-consuming and difficult.

With machine learning, a computer system is programmed to learn from repetition. It takes in training data — essentially, example after example — until it is familiar enough to anticipate with a high degree of confidence the proper response for a given situation.

In this instance, Jigsaw had a team review hundreds of thousands of comments to identify the types of comments that might deter people from a conversation. Based on that data, Perspective provided a score from zero to 100 on how similar the new comments are to the comments identified as toxic.

Jigsaw said it settled on the word toxic after finding that most reviewers shared views about what types of comments drive people away from a conversation. Opinions about what comments constituted, for example, a personal attack ranged widely.

The same methodology is being provided to publishers, who could use the scores to have human moderators review comments only for responses that registered above a certain number, or allow a reader to filter out comments above a certain level of toxicity.

Jigsaw worked with The New York Times and Wikipedia to develop Perspective. The Times's comments section is managed by a team of 14 moderators who manually review nearly every comment.

Because this requires considerable labor and time, The Times allows commenting on only about 10 percent of its articles. The Times said in a statement last year that it made its comments archive available to Jigsaw to help develop the machine-learning algorithm running Perspective. Linda Zebian, a spokeswoman for the Times, declined to comment on Wednesday.

Mr. Cohen said the technology was in its early stages and might flag some false positives, but he expected that it would become more accurate over time with access to a greater set of comments.

Jigsaw, whose stated mission is to use technology to tackle "geopolitical challenges" such as cybersecurity attacks and online censorship, said it also saw opportunities for its machine-learning software to identify comments that are off-topic or unsubstantial.