Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Juro + Wordsmith Form MCP-Based AI Partnership – Artificial Lawyer

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis – Takara TLDR

Nvidia AI chips sales rise but so do fears of an AI bubble bursting

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Perplexity AI

Perplexity gives Apple new reason not to acquire the AI company

By Advanced AI EditorAugust 4, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Perplexity native Mac app

Perplexity has long been accused of deliberately bypassing anti-scraping measures to retrieve web content. While the company has historically dismissed these accusations as disingenuous or misunderstandings, a new report shows that not only is the practice still happening, but it may actually be getting worse.

Perplexity’s main counter-argument: semantics

The issue with Perplexity’s web crawling practices first came to light in June 2024, when Wired and other media outlets accused the company of ignoring the Robots Exclusion Protocol, and pulling content from their websites.

At the time, Perplexity CEO Aravind Srinivas said the culprit was an unnamed third-party web crawling vendor, and that there was “a basic misunderstanding of the way this works.”

It wasn’t long before other publications started accusing Perplexity of plagiarism and unethical web scraping, with The New York Times and the BBC even issuing legal threats. At the time, Perplexity said the BBC was being “manipulative and opportunistic”, and had a “fundamental misunderstanding of technology, the internet and intellectual property law”.

Since then, Perplexity has repeatedly denied this line of accusation, disputing the definition of crawling and scraping in specific use cases. As Wired reported:

In other words, if a user manually provides a URL to an AI, Perplexity says its AI isn’t acting as a web crawler but rather a tool to assist the user in retrieving and processing information they requested. But to Wired and many other publishers, that’s a distinction without a difference because visiting a URL and pulling the information from it to summarize the text sure looks a whole lot like scraping if it’s done thousands of times a day.

Likewise, Srinivas has promised in the past that the company would make it easier to go to the original source of the content surfaced by their answer engine. However, this does not address the fact that the problem is in the sourcing of information, rather than just how it’s presented.

Cloudflare says Perplexity is going out of its way to go after data it is explicitly being told not to crawl

Today, Cloudflare published a report that claims that even when a server specifically denies all automated access, and includes specific rules that block crawling from Perplexity’s public crawlers, Perplexity reportedly does it anyway.

According to Cloudflare:

“We observed that Perplexity uses not only their declared user-agent, but also a generic browser intended to impersonate Google Chrome on macOS when their declared crawler was blocked. Both their declared and undeclared crawlers were attempting to access the content for scraping contrary to the web crawling norms as outlined in RFC 9309. This undeclared crawler utilized multiple IPs not listed in Perplexity’s official IP range, and would rotate through these IPs in response to the restrictive robots.txt policy and block from Cloudflare. In addition to rotating IPs, we observed requests coming from different ASNs in attempts to further evade website blocks. This activity was observed across tens of thousands of domains and millions of requests per day. We were able to fingerprint this crawler using a combination of machine learning and network signals.”

In a statement to The Verge, Perplexity called the blog post a “publicity stunt”, and said that “there are a lot of misunderstandings in the blog post.”

To be fair, the accusation of unduly scraping or pulling web content to present it as part of an AI-generated answer is definitely not exclusive to Perplexity. In the past, OpenAI’s crawling practices were likened to DDoS attacks. The same goes for Anthropic.

It’s also worth remembering that the Robots Exclusion Protocol isn’t a law, but rather a widely followed convention. Still, Cloudflare’s investigation specifically called out Perplexity, which also happens to be the company reportedly under Apple’s consideration for an acquisition. So here we are.

Does Apple really need this headache?

There is absolutely nothing stopping Apple from acquiring Perplexity. In fact, I currently believe that it is more likely that Apple will acquire it, than not. To be perfectly honest, I’m half-expecting the announcement to come out before I’m done writing this piece.

And Apple should buy a company like Perplexity.

But given Apple’s stance on privacy and on doing what is right, should it really acquire a company with such a loaded background and, frankly, attitude?

It is perfectly possible that Apple may believe that under its culture, under its leadership, and under its ethical web crawling practices, it may be able to render the inbound tech free of the supposed sins of the past. But this wouldn’t erase the fact that Perplexity only got to where it got because it did what it reportedly did.

Of course, if Apple decides to acquire Perplexity, that will (hopefully) mean that the company did its due diligence, and didn’t find anything legally compromising.

But it might also mean Apple feels pressured enough to compromise, however slightly, on its core principles to catch up. And if that turns out to be the case, it would be more disappointing than its current lag in AI.

AirPods deals on Amazon

FTC: We use income earning auto affiliate links. More.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleHow Handmade.com modernizes product image and description handling with Amazon Bedrock and Amazon OpenSearch Service
Next Article Building AI to meet customer expectations | EY
Advanced AI Editor
  • Website

Related Posts

Perplexity, the $18 billion AI ‘answer machine,’ wants to play nice with news publishers. They keep suing it anyway

August 28, 2025

XRP, ADA, SOL: Price Predictions by Perplexity AI

August 27, 2025

Financial Times owner Nikkei sues Perplexity AI over copyright infringement claims

August 27, 2025

Comments are closed.

Latest Posts

Artifacts From 2,000-Year-old Sunken City Lifted Out of the Sea

Fita Threatens Legal Action for Uni’s Trans-Inclusive Museum Guidance

Claire Oliver Gallery Expands in New York’s Harlem Neighborhood

Van Gogh Museum Threatens Dutch Government with Closure

Latest Posts

Juro + Wordsmith Form MCP-Based AI Partnership – Artificial Lawyer

August 28, 2025

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis – Takara TLDR

August 28, 2025

Nvidia AI chips sales rise but so do fears of an AI bubble bursting

August 28, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Juro + Wordsmith Form MCP-Based AI Partnership – Artificial Lawyer
  • DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis – Takara TLDR
  • Nvidia AI chips sales rise but so do fears of an AI bubble bursting
  • Google’s AI Weather Model Nailed Its First Major Storm Forecast
  • All 100 AI unicorns since ChatGPT launched

Recent Comments

  1. LhaneUnecy on Ballet Tech Forms The Future Through Dance
  2. OLaneUnecy on Marc Raibert: Boston Dynamics and the Future of Robotics | Lex Fridman Podcast #412
  3. Fobertsig on Study: AI-Powered Research Prowess Now Outstrips Human Experts, Raising Bioweapon Risks
  4. 다낭 유흥 on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. toto togel on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.