Promptwatch Logo

AI Bots & Web Crawlers Directory

Know exactly which AI crawlers, search engine bots, and automated agents reach your site: who runs them, what they do with your content, and how to allow or block each one.

223 bots

AdagioBot

Adagiobot is a web crawler that analyzes websites for advertising demand optimization, helping publishers maximize revenue through real-time bidding analysis and performance insights.

Advertising

AdIdxBot

AdIdxBot is the crawler used by Bing Ads for quality control of ads and their destination websites. It has multiple user agent variants including desktop, iPhone, and Windows Phone versions.

Search Engine Crawler

Adsense

The AdSense crawler visits participating sites in order to provide them with relevant ads.

Search Engine Crawler

adsnaver

Naver's ad crawler that periodically visits registered ad landing pages to collect on-page content for effective ad matching and ranking. It ignores robots.txt for URLs registered in the ad system.

Search Engine Crawler

Adyen Webhook

Adyen’s webhooks (Notification API) send encrypted, real-time HTTP callbacks for key payment and account events, automating order fulfillment, settlement reconciliation, and risk-management workflows.

Webhook

AhrefsBot

Powers the database for both Ahrefs, a marketing intelligence platform, and Yep, an independent, privacy-focused search engine.

SEO

AhrefsSiteAudit

Powers Ahrefs’ Site Audit tool. Ahrefs users can use Site Audit to analyze websites and find both technical SEO and on-page SEO issues.

SEO

AI2Bot

AI2Bot is operated by the Allen Institute for Artificial Intelligence (Ai2) to crawl the web for content to train open-source AI models.

AI CrawlerUnverifiable

aiHitBot

aiHitBot collects and maintains historical information about companies.

AI CrawlerUnverifiable

Algolia

The Algolia Crawler extracts content from your site and makes it searchable.

Search Engine Crawler

Amazon AdBot

Amazon AdBot is a crawler used by different advertising services at Amazon to determine a website's content in order to provide relevant and appropriate advertising.

Search Engine Crawler

Amazon Bedrock Bot

Amazon Bedrock Bot fetches web pages that customers add as data sources to Amazon Bedrock knowledge bases, so Bedrock-powered assistants can answer with that content.

AI Crawler

Amazon Kendra

Amazon Kendra is a managed information retrieval and intelligent search service that uses natural language processing and advanced deep learning model.

AI Assistant

Amazon Product Discovery

Amazon's web crawler used to collect publicly available product details from Amazon Selling Partner websites to help improve the accuracy and completeness of product information on Amazon.

Search Engine CrawlerUnverifiable

Amazon Q

Amazon Q Business is a generative artificial intelligence (generative AI)-powered assistant that you can tailor to your business needs.

AI Assistant

Amazon Route 53 Health Check Service

Amazon Route 53 Health Check Service

Monitoring

Amazon Seller Initiated Listing

Amazon's web crawler that helps sellers succeed by giving them the option to provide a URL to a website and create high-quality product pages in Amazon's store.

E-commerceUnverifiable

Amazonbot

Amazonbot is Amazon's web crawler used to improve our services, such as enabling Alexa to more accurately answer questions for customers.

AI Crawler

Andibot

Andibot gathers web content for Andi, a conversational AI search assistant that answers questions with summaries and sources.

AI AssistantUnverifiable

Anthropic AI

Anthropic AI is a legacy Anthropic crawler that collected broad web data for Claude model development.

AI CrawlerTracked by Promptwatch

APIs-Google

Crawling preferences addressed to the APIs-Google user agent affect the delivery of push notification messages by Google APIs.

Search Engine Crawler

Apple Podcasts

Apple Podcasts crawler that only accesses URLs associated with registered content on Apple Podcasts. Does not follow robots.txt.

Feed Fetcher

Applebot

Applebot powers search features in Apple's ecosystem (Spotlight, Siri, Safari) and may be used to train Apple's foundation models for generative AI features.

AI Crawler

Applebot-Extended

Applebot-Extended is a control token that lets site owners opt out of having content crawled by Applebot used to train Apple's foundation models and Apple Intelligence features.

AI Training

Artemis Web Crawler

Artemis is a calm web reader with which you can follow websites and blogs.

Feed Fetcher

Atlassian Jira Webhooks

Delivers webhook notifications from Jira Cloud when issues, projects, or other resources change.

Webhook

Atlassian Rovo

Crawls and indexes web content for Atlassian Rovo's AI-powered search, chat, and agents.

AI Crawler

Awario Bot

Awario's web crawler used to discover and collect new and updated web data for their social media monitoring and brand mention tracking platform.

MonitoringUnverifiable

Awario RSS Bot

One of Awario's primary web crawlers specialized in collecting RSS feed data.

Feed FetcherUnverifiable

Awario Smart Bot

One of Awario's primary web crawlers that discovers and collects new and updated web data.

AnalyticsUnverifiable

BaiduSpider

Baiduspider is Baidu’s web crawler that indexes websites for inclusion in its Chinese-market search results.

Search Engine Crawler

Barkrowler

Barkrowler is Babbar's web crawler that fuels and updates their graph representation of the web, providing SEO tools for the marketing community.

SEO

Better Stack

Better Stack is a platform for monitoring and alerting on your applications.

Monitoring

Bingbot

Bingbot is Microsoft's web crawler used for indexing websites for Bing Search.

Search Engine Crawler

BraveBot

BraveBot crawls and indexes web pages for the Brave Search index, which also grounds answers from Brave's Leo AI assistant.

AI Assistant

Brightbot

Brightbot is Bright Data's crawler layer that monitors the health of websites and enforces ethical web data collection.

Analytics

Browserbase

Runs headless browser automation on behalf of Browserbase customers for web scraping, form submission, and testing.

AI Crawler

Buffer Link Preview Bot

Helps Buffer users create better social media posts by generating rich previews when they share links

Preview

Bytespider

Bytespider is ByteDance's web crawler used to gather training data for their AI large language models.

AI CrawlerUnverifiable

CCBot

CCBot is operated by the Common Crawl Foundation to crawl web content for AI training and research.

AI Crawler

CensysInspectBot

Censys Inspect is a web crawler operated by Censys that performs internet-wide scanning to discover, monitor, and analyze publicly accessible devices and services.

AnalyticsUnverifiable

Channel3Bot

Crawls product detail pages to index content for AI-powered product discovery, routing shoppers to original websites.

AI Crawler

ChatGPT-Operator

Handles user-initiated requests from ChatGPT operator accessing external content; not used for automated crawling or AI training.

AI Assistant

ChatGPT-User

Handles user-initiated requests in ChatGPT, accessing external content to provide real-time information; not used for automated crawling or AI training.

AI AssistantTracked by Promptwatch

Checkly

Checkly is a platform for monitoring and alerting on your applications.

Monitoring

Chrome Lighthouse

PageSpeed Insights (PSI) reports on the user experience of a page on both mobile and desktop devices, and provides suggestions on how that page may be improved.

Analytics

Chrome Privacy Preserving Prefetch Proxy

Chrome's Privacy Preserving Prefetch Proxy service that fetches /.well-known/traffic-advice to enable privacy-preserving prefetch hints.

Preview

ClarityBot

ClarityBot is seoClarity's web crawler that performs technical SEO audits, analyzes content, and monitors website performance.

SEOUnverifiable

Claude Web

Claude Web is a legacy Anthropic crawler that fetched recent web content for the Claude assistant. Its behavior has largely been folded into ClaudeBot and Claude-User.

AI CrawlerTracked by Promptwatch

Claude-SearchBot

Claude-SearchBot navigates the web to improve search result quality for users. It analyzes online content specifically to enhance the relevance and accuracy of search responses.

AI AssistantTracked by Promptwatch

Claude-User

Claude-User supports Claude AI users. When individuals ask questions to Claude, it may access websites using a Claude-User agent.

AI AssistantTracked by Promptwatch

ClaudeBot

ClaudeBot helps enhance the utility and safety of our generative AI models by collecting web content that could potentially contribute to their training.

AI CrawlerTracked by Promptwatch

Cohere AI

Cohere AI collects publicly available web text that helps train and refine Cohere's large language models for enterprise generative AI.

AI CrawlerTracked by Promptwatch

ContentKingBot

ContentKing (now Conductor Website Monitoring) is a website monitoring tool that continuously audits websites to help improve their performance and visibility.

AnalyticsUnverifiable

Cookiebot

Cookiebot automates compliance with cookie laws and helps you manage your cookie consent preferences.

Monitoring

CookieScript

A cookie scanning bot that examines websites for cookie usage to help maintain GDPR and other privacy regulation compliance.

MonitoringUnverifiable

Cotoyogi

Cotoyogi is a web crawler operated by the Center for Research and Development on Data Lake, ROIS-DS (Research Organization of Information and Systems - Data Science) for collecting Japanese language...

AI CrawlerUnverifiable

Coveobot

Coveobot is a crawler operated by Coveo that indexes content for enterprise search, recommendations, and generative experience platforms.

AI AssistantUnverifiable

CriteoBot

CriteoBot is a crawler operated by Criteo that analyzes web content to serve relevant contextual ads.

Advertising

Customer.io webhooks

Customer.io's webhook service for event-driven marketing automation and customer data platform.

Webhook

Cybaa Agent

Performs user-initiated security checks on behalf of Cybaa customers, validating security headers, TLS/SSL configuration, and other domain-specific security controls to ensure website compliance and...

Monitoring

Dash0 Synthetic Monitoring

Dash0's Synthetic Monitoring provides proactive, automated insights into the availability and performance of your websites and APIs.

Monitoring

Datadog Synthetic Monitoring Robot

Datadog's automated monitoring service that performs synthetic tests to verify website availability and performance.

Monitoring

DataForSeoBot

DataForSeoBot is a backlink checker bot operated by DataForSEO that crawls websites to build and maintain their backlink database.

SEO

DeepSeek Bot

DeepSeek Bot crawls web content used to train and improve DeepSeek's generative AI models.

AI CrawlerTracked by Promptwatch

Detectify

Detectify is a web security scanner that performs automated security tests on web applications and attack surface monitoring.

Monitoring

Diffbot

Diffbot crawls and structures web pages into a knowledge graph that is sold for AI training, retrieval, and data enrichment.

AI Crawler

DigitalOceanUptimeBot

DigitalOcean Uptime is a monitoring service that checks the health of any URL or IP address.

MonitoringUnverifiable

Discord Bot

Discord's link preview bot that crawls URLs shared in Discord chats to generate rich previews.

PreviewUnverifiable

DotBot

DotBot is a web crawler operated by Moz (formerly SEOmoz) that collects data for their Link Explorer tool and Links API.

SEOUnverifiable

DuckAssistBot

DuckAssistBot is a web crawler for DuckDuckGo Search that crawls pages in real-time for AI-assisted answers, which prominently cite their sources. This data is not used in any way to train AI models.

AI Assistant

DuckDuckBot

DuckDuckBot is a web crawler for DuckDuckGo. DuckDuckBot’s job is to constantly improve search results and offer users the best and most secure search experience possible.

Search Engine Crawler

Facebook Webhooks

Facebook's webhook service that delivers real-time event notifications for Meta platform events and changes.

Webhook

FacebookBot

FacebookBot crawls public web content that Meta may use to improve language models and other AI products. It is distinct from the link-preview fetcher facebookexternalhit.

AI Crawler

FacebookExternalHit

Fetches content for shared links on Meta platforms to generate rich previews.

Preview

FalBot

fal.ai's webhook service that delivers asynchronous notifications for AI model processing and generation tasks.

Webhook

FlipboardProxy

Fetches and prepares website content for presentation in the Flipboard application.

Feed FetcherUnverifiable

GeedoProductSearchBot

GeedoProductSearch is a web crawler operated by Geedo SIA that indexes product information from e-commerce websites.

E-commerce

Gemini Deep Research

Gemini Deep Research is Google's AI-powered research tool that performs comprehensive multi-step research on complex topics, analyzing web content to provide detailed insights and answers.

AI Assistant

GitHub Camo

GitHub's image proxy service

Preview

GitHub Hookshot

GitHub's webhooks for events like push, pull request, etc.

Webhook

Google AdMob Reward Verification

Sends server-side verification callbacks to confirm users completed rewarded ad views.

Advertising

Google Ads Creatives Assistant

Fetches website content for Google Ads creative generation and enhancement tools.

AI Assistant

Google AdsBot

Google AdsBot is Google's web crawler for quality control of Google Ads.

Search Engine Crawler

Google Association Service

Verifies associations between apps and websites for Digital Asset Links.

Verification

Google Business Link Verification

Verifies that business links in Google Business Profile are accessible and return valid HTTP status codes.

Verification

Google Docs

Fetches images and page content when users insert links into Google Docs.

Preview

Google Feedfetcher

Feedfetcher is used for crawling RSS or Atom feeds for Google News and PubSubHubbub.

Feed Fetcher

Google Image Proxy

Google's image caching proxy service used by Gmail and other Google services to cache and serve images.

Preview

Google NotebookLM

Google NotebookLM fetches web sources that a user adds to a notebook so the assistant can summarize, answer questions, and cite them. Because fetches are user-initiated, it may bypass robots.txt.

AI Assistant

Google PageRenderer

Upon user request, Google Page Renderer fetches and renders web pages.

Preview

Google Publisher Center

Google Publisher Center fetches and processes feeds that publishers explicitly supplied for use in Google News landing pages.

Feed Fetcher

Google Read Aloud

Upon user request, Google Read Aloud fetches and reads out web pages using text-to-speech (TTS).

User Initiated

Google Site Verifier

Google Site Verifier fetches Search Console verification tokens.

Verification

Google StoreBot

Crawling preferences addressed to the Storebot-Google user agent affect all surfaces of Google Shopping (for example, the Shopping tab in Google Search and Google Shopping).

Search Engine Crawler

Google-Adwords-Instant

Fetches advertiser landing pages when triggered by user actions in the Google Ads platform.

Advertising

Google-Agent

Google-Agent navigates the web and performs actions upon user request, used by agents hosted on Google infrastructure such as Project Mariner.

AgentTracked by Promptwatch

Google-CloudVertexBot

Crawling preferences addressed to the Google-CloudVertexBot user agent affect crawls requested by the site owners' for building Vertex AI Agents. It has no effect on Google Search or other products.

AI Assistant

Google-Display-Ads-Bot

Verifies site eligibility during the AdSense approval process.

Search Engine Crawler

Google-Extended

Google-Extended is a standalone product token that web publishers can use to manage whether their sites help improve Gemini Apps and Vertex AI generative APIs, including future generations of models...

AI CrawlerTracked by Promptwatch

Google-InspectionTool

Crawling preferences addressed to the Google-InspectionTool user agent affect Search testing tools such as the Rich Result Test and URL inspection in Search Console.

Monitoring

Google-Safety

The Google-Safety user agent handles abuse-specific crawling, such as malware discovery for publicly posted links on Google properties. As such it's unaffected by crawling preferences.

Monitoring

Googlebot

Crawling preferences addressed to the Googlebot user agent affect Google Search (including Discover and all Google Search features), as well as other products such as Google Images, Google Video,...

Search Engine Crawler

GoogleOther

Crawling preferences addressed to the GoogleOther user agent don't affect any specific product.

Search Engine Crawler

GoogleStackdriverMonitoringBot

GoogleStackdriverMonitoringBot is operated by Google Cloud to perform uptime checks and monitor availability of services.

MonitoringUnverifiable

GPT-Actions

Enables ChatGPT to interact with external APIs and retrieve real-time information from the web in response to user-initiated requests; allows access to up-to-date content without being used for...

AI Assistant

GPTBot

Crawls web content to improve OpenAI's generative AI models and ChatGPT; respects 'robots.txt' directives to exclude sites from training data.

AI CrawlerTracked by Promptwatch

Grok DeepSearch

Grok DeepSearch performs multi-step research across the web to answer complex Grok queries with cited sources.

AI AssistantUnverifiableTracked by Promptwatch

Grok Search

Grok Search fetches web pages in real time to power Grok's search and answer features inside X and the Grok apps.

AI AssistantUnverifiableTracked by Promptwatch

GrokBot

GrokBot is xAI's crawler used to gather web content for training the Grok family of models. xAI publishes limited documentation for it.

AI CrawlerUnverifiableTracked by Promptwatch

GTmetrix

GTmetrix provides metrics and insights for your site's loading speed and performance.

Analytics

HetrixTools Uptime Monitoring Bot

HetrixTools Uptime Monitoring Bot is used by HetrixTools's monitoring services to perform various checks on websites, including uptime and performance monitoring.

Monitoring

Hookdeck

A reliable Event Gateway for event-driven applications

Webhook

Hydrozen

Hydrozen is a tool for monitoring availability of your websites, Cronjobs, APIs, Domains, SSL etc.

Monitoring

IASBot

IAS (Integral Ad Science) crawler, formerly known as AdmantX, is used for analyzing web content to ensure brand safety and suitability for advertisers.

AdvertisingUnverifiable

iAskBot

iAskBot crawls and indexes web content to power iAsk.ai, an AI question-answering search engine.

AI AssistantUnverifiable

Iframely

Fetches your page metadata to generate rich link previews when users share your links across apps, blogs, and news sites, enhancing content visibility and engagement.

Preview

ImagesiftBot

ImageSiftBot is a web crawler that scrapes the internet for publicly available images to support Hive's suite of web intelligence products.

AI Crawler

Inngest

Inngest is a platform for building event-driven applications.

Webhook

InternetMeasurementBot

InternetMeasurementBot is operated by driftnet.io to discover and measure services that network owners and operators have publicly exposed.

MonitoringUnverifiable

Jobs with GPT

Crawls job-related pages to power jobswithgpt.com, a platform for discovering AI-enhanced career opportunities.

Search Engine Crawler

Kernel Browsers

Runs browser automation on behalf of Kernel customers for web agents, automations, and web scraping.

AI Crawler

LinerBot

LinerBot gathers web content for Liner, an AI research and answer assistant that cites the sources behind its responses.

AI AssistantUnverifiable

LinkedInBot

LinkedInBot is a bot that renders links shared on LinkedIn.

Preview

LogicMonitor SiteMonitor

LogicMonitor SiteMonitor monitors your website's uptime, performance, and availability from multiple global regions.

Monitoring

LogRocketBot

LogRocket Asset Cacher is a bot that captures and caches web assets (CSS, JavaScript, images) to ensure proper playback of user sessions in LogRocket's session replay feature.

AnalyticsUnverifiable

Lumar

The Lumar website intelligence platform is used by SEO, engineering, marketing and digital operations teams to monitor the performance of their site’s technical health, and ensure a high-performing,...

SEO

Marfeel Audits Crawler

Marfeel's audit crawlers that periodically re-crawl traffic-receiving URLs to detect structured data, meta tags, and HTML issues.

SEO

Marfeel Flowcards Crawler

Marfeel's crawler that fetches content for Flowcards that load directly from specific URLs.

Preview

Marfeel Preview Crawler

Marfeel's previewer crawler used to render preview experiences for both mobile and desktop views.

Preview

Marfeel Social Crawler

Marfeel's crawler used for social experiences (Facebook, X/Twitter, Telegram, Reddit, LinkedIn).

Preview

meta-externalads

Crawls the web to improve advertising and business-related products and services.

Advertising

meta-externalagent

The Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.

AI Crawler

meta-externalfetcher

The Meta-ExternalFetcher crawler performs user-initiated fetches of individual links to support specific product functions.

User Initiated

meta-webindexer

Crawls web content to provide search results for Meta AI users.

AI Crawler

MicrosoftPreview

MicrosoftPreview generates page snapshots for Microsoft products. It has desktop and mobile variants, with Chrome version dynamically updated to match the latest Microsoft Edge version.

Preview

MistralAI-Index

MistralAI-Index crawls and indexes web content for Mistral's search feature in Le Chat. Content it indexes is not used to train Mistral's generative models.

AI AssistantTracked by Promptwatch

MistralAI-User

MistralAI-User fetches web pages in real time when someone asks Le Chat a question, so Mistral's assistant can answer with current information and link to sources. It is not used for AI training.

AI AssistantTracked by Promptwatch

MJ12bot

MJ12bot is a web crawler operated by Majestic-12 Ltd, a UK-based company that builds a search engine focused on backlink analysis and web structure mapping.

Search Engine CrawlerUnverifiable

MomenticBot

Momentic is a AI-powered platform for software testing. It allows you to write reliable end-to-end tests for web apps in a simple and intuitive way using natural language.

Monitoring

naver-blueno

Naver's preview-snippet crawler that fetches summary information (titles, descriptions, images) when users insert links in Naver services such as blogs or cafés.

Preview

naverbot

Naver's web crawler (also known as Yeti) is used by Naver, South Korea's largest search engine, to crawl and index web content.

Search Engine Crawler

NewRelic Minions

New Relic Synthetic monitoring infrastructure that performs API checks and virtual browser instances to monitor websites and applications from global locations

Monitoring

OAI-AdsBot

Validates the safety of web pages submitted as ads on ChatGPT; data collected is not used to train generative AI foundation models.

AdvertisingTracked by Promptwatch

OAI-SearchBot

Indexes websites for inclusion in ChatGPT's search results; does not crawl content for AI model training.

AI AssistantTracked by Promptwatch

OhDearBot

OhDearBot is a monitoring bot operated by Oh Dear that performs uptime checks, broken link detection, and mixed content scanning.

Monitoring

Omgilibot

Omgilibot crawls public web content for Webz.io, which packages and licenses web data feeds that are commonly used to train AI models.

AI Crawler

OpenGraphXYZBot

Bot for opengraph.xyz service that generates and previews Open Graph meta tags and dynamic social media images

PreviewUnverifiable

PanguBot

PanguBot crawls web content used to train Huawei's Pangu family of large language models.

AI CrawlerUnverifiable

PayPal

PayPal delivers real-time event notifications for payments, subscriptions, and account updates.

Webhook

Perplexity-User

Handles user-initiated requests in Perplexity, accessing external content to provide real-time information; not used for automated crawling or AI training.

AI AssistantTracked by Promptwatch

PerplexityBot

Indexes websites for inclusion in Perplexity's search results; does not crawl content for AI model training.

AI AssistantTracked by Promptwatch

PetalBot

PetalBot is a web crawler operated by Huawei's Petal Search engine.

AI Assistant

PhindBot

PhindBot crawls technical and developer-focused web content to power Phind, an AI answer engine aimed at programmers.

AI AssistantUnverifiable

Pingdom Bot

Pingdom Bot is used by Pingdom's monitoring services to perform various checks on websites, including uptime and performance monitoring.

Monitoring

Pinterest Bot

Pinterest's web crawler that indexes content for their platform. It crawls websites to collect metadata for Pins, including images, titles, descriptions, and prices.

Search Engine Crawler

Polar Webhooks

Polar's webhook service delivers real-time event notifications for payment processing, including purchases, subscriptions, cancellations, and refunds.

Webhook

Promptwatch Bot

Promptwatch Bot is Promptwatch's verification crawler that validates crawler-log integrations and runs on-demand SEO and performance audits.

Verification

ProximicBot

Proximic is Comscore's web crawler that performs contextual content analysis to help advertisers determine the best matching campaigns for a page's content.

AdvertisingUnverifiable

PulsePoint Crawler

A web crawler used by PulsePoint, a digital advertising technology company, for content indexing and ads.txt verification.

Advertising

QA.tech

The QA.tech web agent browses the website and identifies potential test cases, and executes tests against a web application

Monitoring

QStash

QStash is a platform for building event-driven applications.

Webhook

Quantcastbot

Quantcast Bot is a web crawler used for advertisement quality assurance and to understand page content for Interest-Based Audiences.

Advertising

Qwantbot

Crawls and indexes web content for Qwant search engine.

Search Engine Crawler

Razorpay-Webhook

Razorpay’s webhooks enable merchants to receive secure, real-time HTTP callbacks for key payment events, automating reconciliation, notifications, and downstream workflows.

Webhook

Redirect pizza destination monitor

redirect.pizza's destination monitor ensures that the redirect destination URLs are reachable.

Monitoring

RyeBot

Powers automated checkout on behalf of shoppers with explicit consent.

AI Assistant

Sanity Webhooks

Sanity's webhook service that delivers real-time event notifications for content changes and other events.

Webhook

Sansec Security Monitor

Sansec Security Monitor is a web crawler that monitors online stores for malicious code, data breaches, and digital skimming attacks.

Monitoring

SBIntuitionsBot

SBIntuitionsBot is a crawler operated by SB Intuitions Corp. that collects web data for AI development and information analysis.

AI CrawlerUnverifiable

ScreamingFrogBot

Screaming Frog SEO Spider is a website crawler used by SEO professionals for site audits and technical SEO analysis.

SEOUnverifiable

SE Ranking Backlinks

SE Ranking's backlink analysis crawler that discovers and analyzes backlink profiles for SEO research and competitive analysis.

SEO

SeekportBot

SeekportBot is the web crawler for Seekport, a German search engine operated by SISTRIX. The bot crawls and indexes web content while respecting robots.txt directives and crawl delays.

Search Engine Crawler

SemanticScholarBot

The Semantic Scholar bot crawls domains to find academic PDFs. These PDFs are served on semanticscholar.org so researchers can discover and understand other academic accomplishments.

AI CrawlerUnverifiable

Semrush

Semrush is a platform for SEO, content marketing, competitor research, PPC and social media marketing.

SEO

Semrush Site Audit

Semrush Site Audit is a powerful website crawler that analyzes the health of a website by checking for on-page and technical SEO issues, including duplicate content, broken links, HTTPS...

SEO

Sentry Uptime Monitoring Bot

Sentry's Uptime Monitoring Bot performs health checks on configured URLs to monitor the availability and reliability of web services.

Monitoring

Seobility

Seobility is a browser-based online SEO software that helps you improve your website’s search engine rankings.

Search Engine Crawler

SeznamBot

SeznamBot is the web crawler operated by Seznam.cz, the leading Czech search engine.

Search Engine Crawler

ShapBot

Crawls and indexes web content to power Parallel's search and content extraction APIs for AI applications.

AI Crawler

Shopify Webhooks

Shopify webhooks are useful for keeping your app in sync with Shopify data, or as a trigger to perform an additional action after that event has occurred.

E-commerceUnverifiable

SISTRIX Optimizer Uptime

SISTRIX Optimizer Uptime bot performs continuous monitoring of website availability by checking the startpage once per minute. It is part of SISTRIX's SEO and website monitoring platform.

MonitoringUnverifiable

Site24x7

Site24x7 Bot is used by Site24x7's monitoring services to perform various checks on websites, including uptime and performance monitoring.

Monitoring

Sitebulb

Sitebulb is a desktop and cloud-based website crawler used by SEO professionals for technical SEO audits.

SEOUnverifiable

Slack-ImgProxy

Slack-ImgProxy is a bot operated by Slack that fetches and caches images posted in Slack channels.

PreviewUnverifiable

Slackbot

Slackbot is Slack's default, general-purpose bot that handles various API requests and integrations.

PreviewUnverifiable

SlackLinkExpandingBot

Slackbot Link Expanding is a bot operated by Slack that fetches metadata from shared links to create rich previews.

PreviewUnverifiable

SnapchatAdsBot

SnapchatAdsBot is a crawler operated by Snapchat that verifies and analyzes websites for their advertising platform.

AdvertisingUnverifiable

SnapURLPreviewBot

SnapURLPreviewBot is a crawler operated by Snap Inc. that analyzes and generates previews of URLs shared on Snapchat and other Snap platforms.

AnalyticsUnverifiable

Sogou Web Spider

The web crawler for sogou.com

Search Engine CrawlerUnverifiable

Stably

Stably is a QA testing bot that users run to E2E test their websites for functionality testing and protecting user flows against regressions.

Monitoring

StatusCake Page Speed

StatusCake Page Speed monitors your page load and render speeds.

Monitoring

StatusCake SSL Monitoring

StatusCake SSL monitors your website certificates for common issues

Monitoring

StatusCake Uptime

StatusCake monitors the uptime of your website.

Monitoring

Stripe Webhooks

Stripe's webhook service that delivers real-time event notifications for payment processing and account updates.

Webhook

Stripebot

Crawls Stripe merchant websites to collect data for service delivery and financial regulatory compliance.

Analytics

svix

svix is a webhook service for sending events to webhooks.

Webhook

TangibleeBot

TangibleeBot is a crawler operated by Tangiblee that collects product data from e-commerce websites to power their product visualization and virtual try-on services.

E-commerceUnverifiable

TermlyBot

Crawls websites to detect and categorize cookies set by first and third parties.

Monitoring

TikTokSpider

TikTokSpider is a web crawler used by TikTok/ByteDance to index and analyze web content for their platform. It helps in content discovery, link previews, and data collection for TikTok's services.

AI CrawlerUnverifiable

Timpibot

Timpibot crawls the web to build Timpi's decentralized search and data index, which is used to supply training and grounding data for AI applications.

AI CrawlerUnverifiable

Trendiction Bot

Trendiction's web crawler that discovers and collects public web data for their social media monitoring and media intelligence platform.

AnalyticsUnverifiable

TTD-Content

TTD-Content is a crawler operated by The Trade Desk that verifies content and quality of ad placements for their demand-side platform.

AdvertisingUnverifiable

Twilio Knowledge

Twilio's AI assistant crawler that gathers web content to build knowledge bases for Twilio AI Assistants, enabling conversational AI experiences with up-to-date information.

AI CrawlerUnverifiable

Twilio Proxy

Twilio's proxy service that handles communications between end-users and applications through Twilio's programmable voice and messaging platform.

WebhookUnverifiable

Twitterbot

Fetches content for shared links on X/Twitter to generate rich previews.

Preview

Updown.io

Performs uptime and performance checks on websites.

Monitoring

Uptime Robot

Uptime Robot is a platform for monitoring and alerting on your applications.

Monitoring

UsercentricsBot

UsercentricsBot is operated by Usercentrics GmbH to scan websites for data processing services and third-party technologies.

AnalyticsUnverifiable

v0bot

Bot for v0 services.

AI Crawler

Velen Public Web Crawler

Velen Public Web Crawler collects public web content for Webz.io's data feeds, which are licensed for AI training, market intelligence, and monitoring.

AI Crawler

Vemetric Favicon Bot

Fetches favicons from websites in the highest quality available.

Preview

Vercel build container

System-initiated requests made from Vercel's build container during a build

Preview

Vercel Favicon Bot

Vercel Favicon Bot

Preview

Vercel Screenshot Bot

Vercel Screenshot Bot

Preview

vercelflags

vercel flags

Monitoring

verceltracing

vercel tracing

Monitoring

Yahoo Ad Monitoring

Yahoo Ad Monitoring crawls landing pages of URLs listed with Yahoo advertising services to analyze content quality, ensure ad relevance, and improve user experience by maintaining accurate ad...

Advertising

Yahoo! Slurp

Yahoo! Slurp is the web crawler (robot) used by Yahoo! Search to discover and index web pages for its search engine.

Search Engine Crawler

YandexAdditional

YandexAdditional is the crawler Yandex uses to collect web content for its YandexGPT and other generative AI products, separate from the YandexBot search crawler.

AI Crawler

Yandexbot

YandexBot is a web crawler operated by Yandex, a major Russian search engine.

Search Engine Crawler

YisouSpider

YisouSpider is a search engine crawler operated by Yisou that indexes web content for their search engine results. The crawler follows standard crawling practices and respects robots.txt directives.

Search Engine CrawlerUnverifiable

YouBot

YouBot crawls and indexes web pages to power the You.com AI search engine and its cited answers.

AI Assistant

Be the brand AI recommends

Monitor your brand's visibility across ChatGPT, Claude, Perplexity, and Gemini. Get actionable insights and create content that gets cited by AI search engines.

Promptwatch Dashboard