1592+ crawlers perfilados
Directorio de Crawlers
Todos los crawlers web y bots de IA que rastreamos. Qué hace cada uno, quién lo opera y cómo proteger su contenido.
New York Times Newsgathering — What It Is and How to Handle It
New York Times Newsgathering is a content aggregator that collects and indexes data from multiple sources.
AggregatorAggregator
GuestpostsBot — What It Is and How to Handle It
GuestpostsBot is a Web Crawler that has several functions to facilitate the website owner who has registered his site on the guestposts.com.br platform to monitor his site. The bot constantly tracks the sites registered on the platform in order to check if the partnerships made on the guestpost platform are still active, in addition to validating if the site exists to allow registration and also monitoring the status of the site from time to time to warn the website owner in case of any inoperability.
MonitoringMonitoring
Cloudtrellis — What It Is and How to Handle It
Cloudtrellis automatically scans entire websites for broken links, accessibility issues, SEO problems, and security risks to ensure optimal user experience and search engine performance.
MonitoringMonitoring
Spark Shipping — What It Is and How to Handle It
Spark Shipping is eCommerce automation software for retailers running WooCommerce
OtherOther
Qualys SSL Scanner — What It Is and How to Handle It
Qualys SSL Scanner is a monitoring crawler that checks website availability, performance, and health.
MonitoringMonitoring
Artemis Web Reader — What It Is and How to Handle It
Artemis Web Reader is a feed reader that fetches and processes RSS, Atom, and other content feeds.
Feed ReaderFeed Reader
MgidBot — What It Is and How to Handle It
MgidBot is MGID's web crawler that analyzes website content to detect context categories and topics for targeted native advertising recommendations and content discovery.
AdvertisingAdvertising
MainWP — What It Is and How to Handle It
Website Managed - MainWP Control Dashboard for accessing MainWP child sites.
OtherOther
Loomly Bot — What It Is and How to Handle It
LoomlyBot is used to extract metadata from web pages in order to show a social media post preview within Loomly so that clients can see what their social media posts will look like when published.
PreviewPreview
SalesViewerBot — What It Is and How to Handle It
Since we offer sales and marketing information we need to enrich the company information. To provide crucial company information inside our service we need to provide a preview of visitor websites. Therefore we need to visit the websites.
AdvertisingAdvertising
Notabot — What It Is and How to Handle It
Notabot is a web crawler that integrates with Helpfeel's AI-powered FAQ search system, indexing website content to provide intelligent search and auto-complete capabilities for customer support.
Search EngineSearch Engine
Fastmail Bot — What It Is and How to Handle It
Fastmail Bot is a link preview crawler that fetches page metadata to generate rich previews when URLs are shared.
PreviewPreview
Black Duck Fast Dynamic — What It Is and How to Handle It
Black Duck Fast Dynamic is a security scanner for software composition analysis (SCA), identifying open source vulnerabilities and license compliance issues in web applications.
SecuritySecurity
Hype Machine — What It Is and How to Handle It
Hype Machine crawls music publications and blogs since 2005 to discover new artist posts and builds curated playlists from this metadata for music listeners and discovery.
Search EngineSearch Engine
Coinbase Webhooks — What It Is and How to Handle It
Coinbase Webhooks are automated messages sent from the Coinbase platform to a user's server, used for notifying users about events such as receiving crypto payments.
OtherOther
Easydns — What It Is and How to Handle It
EasyDNS' uptime monitoring probe.
MonitoringMonitoring
videootv Bot — What It Is and How to Handle It
Crawler to extract the newest articles in the publisher's website (via feed or parsing html) to make a carrousel with images, links and text for our native ads module in order to improve recirculation in the publisher's web. Only crawls our publisher's webpages.
AggregatorAggregator
webpagetest — What It Is and How to Handle It
WebPageTest is one of the most popular and free tools for measuring webpage performance and enables you to run web performance tests on your site from a number of different locations across the world in a number of different browsers.
MonitoringMonitoring
honeybadgerbot — What It Is and How to Handle It
honeybadgerbot is a monitoring crawler that checks website availability, performance, and health.
MonitoringMonitoring
RetroListeCOM — What It Is and How to Handle It
RetroListeCOM is a monitoring bot that regularly visits gaming websites, particularly Habbo Hotel retros, to collect and track real-time user count statistics for display on the RetroListe.com ranking platform.
MonitoringMonitoring
WMF Zotero Translation Server — What It Is and How to Handle It
The Wikimedia Foundation's Zotero Translation Server is a customized metadata extraction tool that powers Citoid (see wikimedia-citoid), retrieving citation data from URLs, DOIs, and ISBNs using Zotero translators, on demand from website visitor requests.
OtherOther
Cloudflare Validator — What It Is and How to Handle It
Cloudflare Validator makes requests to verify IPs for Cloudflare Bots Directory
OtherOther
OpenRSS — What It Is and How to Handle It
OpenRSS is a nonprofit RSS feed service bot that crawls websites to generate and provide free RSS feeds, allowing users to follow website updates in a reader app.
Feed ReaderFeed Reader
MotoMinerBot — What It Is and How to Handle It
MotoMinerBot crawls and indexes vehicle detail pages from automotive websites to build MotoMiner's searchable vehicle database and search engine platform.
AggregatorAggregator
Innguma Fetcher — What It Is and How to Handle It
Innguma Fetcher is a feed reader that fetches and processes RSS, Atom, and other content feeds.
Feed ReaderFeed Reader
ConvermaxBot — What It Is and How to Handle It
ConvermaxBot is a feed reader that fetches and processes RSS, Atom, and other content feeds.
Feed ReaderFeed Reader
FeedWind Crawler — What It Is and How to Handle It
FeedWind Crawler is a feed reader that fetches and processes RSS, Atom, and other content feeds.
Feed ReaderFeed Reader
jobswithgptcom-bot — What It Is and How to Handle It
Simple crawler focussing on only job postings for job search site.
Search EngineSearch Engine
SequelWP — What It Is and How to Handle It
SequelWP is a monitoring agent that checks website uptime on 5-minute intervals for verified customers with valid x-sequelwp headers, helping ensure web hosting service reliability.
MonitoringMonitoring
Swifteq Link Checker — What It Is and How to Handle It
Help Center Export is a Zendesk-approved app that integrates with any Zendesk help center and helps the customers with these tasks: Export all your articles and any meta-data: title, section, link, labels, updated time. Export all references to internal and external docs. Detect and export broken links and images for each article. In order to check for broken links the app is using a bot that attempts to access each link present in help center articles and check the response for errors.
MonitoringMonitoring