{"id":6744,"date":"2025-08-13T09:06:00","date_gmt":"2025-08-13T09:06:00","guid":{"rendered":"https:\/\/sandbox.hbmadvisory.com\/amplify\/yomiuri-sues-perplexity-for-alleged-mass-scraping\/"},"modified":"2025-08-13T09:47:23","modified_gmt":"2025-08-13T09:47:23","slug":"yomiuri-sues-perplexity-for-alleged-mass-scraping","status":"publish","type":"post","link":"https:\/\/sandbox.hbmadvisory.com\/amplify\/yomiuri-sues-perplexity-for-alleged-mass-scraping\/","title":{"rendered":"Yomiuri sues Perplexity for alleged mass scraping"},"content":{"rendered":"<p><\/p>\n<div>\n<p>Japan\u2019s largest newspaper is suing Perplexity, accusing the AI search startup of copying more than 119,000 articles to train and power its chatbot. Yomiuri Shimbun Holdings said it filed the claim on August 7 in Tokyo District Court, seeking about \u00a52.17bn ($15m) in damages and an injunction to stop Perplexity from reproducing or distributing its content.<\/p>\n<p>The suit, brought by three Yomiuri corporate entities in Tokyo, Osaka and Fukuoka, alleges that Perplexity servers accessed the paper\u2019s site between February and June to harvest articles later used in responses to user queries. The company argues this infringes reproduction and public-transmission rights under Japanese copyright law. <\/p>\n<p>Jiji Press, via Nippon.com, reported the paper is also concerned that \u201czero-click\u201d answers reduce site visits and threaten the business model that supports its journalism.<\/p>\n<p>Under a 2018 amendment to Japan\u2019s Copyright Act, reproductions for machine learning and data analysis can be made without prior authorisation, but only if they do not \u201cunreasonably prejudice\u201d rights holders. Legal analysts say the Yomiuri case will be an early test of how far that exception extends.<\/p>\n<p>The action is also tied to web-security claims. Cloudflare has alleged that Perplexity used undeclared crawlers which masked their identity and evaded robots.txt and firewall rules. The company said it removed Perplexity from its verified bot list and updated security rules, with its findings likely to feature in court if Yomiuri pursues that argument.<\/p>\n<p>Perplexity has grown rapidly in the AI search market, with Bloomberg reporting its valuation reached about $9bn in late 2024. It has millions of users and runs subscription and revenue-sharing programmes with publishers.<\/p>\n<p>The lawsuit is the first brought by a major Japanese publisher against an AI company. Its outcome could influence how the 2018 copyright exceptions are interpreted, and whether courts demand clearer consent or compensation mechanisms for large-scale crawling that damages news organisations\u2019 commercial interests.<\/p>\n<p>Source: <a href=\"https:\/\/www.noahwire.com\" rel=\"nofollow noopener\" target=\"_blank\">Noah Wire Services<\/a><\/p>\n<\/p><\/div>\n<div>\n<h3 class=\"mt-0\">Noah Fact Check Pro<\/h3>\n<p class=\"text-sm\">The draft above was created using the information available at the time the story first<br \/>\n        emerged. We\u2019ve since applied our fact-checking process to the final narrative, based on the criteria listed<br \/>\n        below. The results are intended to help you assess the credibility of the piece and highlight any areas that may<br \/>\n        warrant further investigation.<\/p>\n<h3 class=\"mt-3 mb-1 font-semibold text-base\">Freshness check<\/h3>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Score:<br \/>\n        <\/span>10<\/p>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Notes:<br \/>\n        <\/span>The narrative is fresh, with the earliest known publication date being August 8, 2025. No earlier versions with different figures, dates, or quotes were found. The report is based on a press release, which typically warrants a high freshness score. No discrepancies or recycled content were identified.<\/p>\n<h3 class=\"mt-3 mb-1 font-semibold text-base\">Quotes check<\/h3>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Score:<br \/>\n        <\/span>10<\/p>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Notes:<br \/>\n        <\/span>No direct quotes were identified in the provided text. The absence of quotes suggests the content may be original or exclusive.<\/p>\n<h3 class=\"mt-3 mb-1 font-semibold text-base\">Source reliability<\/h3>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Score:<br \/>\n        <\/span>8<\/p>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Notes:<br \/>\n        <\/span>The narrative originates from Svaboda.org, a reputable news outlet. However, the article is in Belarusian, which may limit accessibility for some readers.<\/p>\n<h3 class=\"mt-3 mb-1 font-semibold text-base\">Plausability check<\/h3>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Score:<br \/>\n        <\/span>10<\/p>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Notes:<br \/>\n        <\/span>The claims made in the narrative are plausible and align with reports from other reputable sources, such as The Asahi Shimbun and Nippon.com. The language and tone are consistent with typical news reporting.<\/p>\n<h3 class=\"mt-3 mb-1 font-semibold text-base\">Overall assessment<\/h3>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Verdict<\/span> (FAIL, OPEN, PASS): <span class=\"font-bold\">PASS<\/span><\/p>\n<p class=\"text-sm pt-0\"><span class=\"font-bold\">Confidence<\/span> (LOW, MEDIUM, HIGH): <span class=\"font-bold\">HIGH<\/span><\/p>\n<p class=\"text-sm mb-3 pt-0\"><span class=\"font-bold\">Summary:<br \/>\n        <\/span>The narrative is fresh, original, and sourced from a reputable outlet. The claims are plausible and consistent with other reports. No significant issues were identified, leading to a high confidence in the assessment.<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Japan\u2019s largest newspaper is suing Perplexity, accusing the AI search startup of copying more than 119,000 articles to train and power its chatbot. Yomiuri Shimbun Holdings said it filed the claim on August 7 in Tokyo District Court, seeking about \u00a52.17bn ($15m) in damages and an injunction to stop Perplexity from reproducing or distributing its<\/p>\n","protected":false},"author":1,"featured_media":6745,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[118],"tags":[],"class_list":{"0":"post-6744","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-publishing-news"},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/posts\/6744","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/comments?post=6744"}],"version-history":[{"count":1,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/posts\/6744\/revisions"}],"predecessor-version":[{"id":6746,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/posts\/6744\/revisions\/6746"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/media\/6745"}],"wp:attachment":[{"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/media?parent=6744"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/categories?post=6744"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sandbox.hbmadvisory.com\/amplify\/wp-json\/wp\/v2\/tags?post=6744"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}