GameIndustry.eu Logo

2020 2019 2016 2019 2018 2021 2016 
GameIndustry.eu /  Bot & Crawlerlist

  Bot & Crawler List – Download IPv4 Addresses and User Agents


Websites have long been scanned by automated systems — for search engine indexing, content extraction (scraping), vulnerability detection, or increasingly, the collection of training data for AI models.

In many cases, this occurs without the operators’ consent and in violation of technical restrictions such as the robots.txt file or applicable legal regulations. Service companies and larger corporations naturally make use of data and content when and how it suits them, usually without transparency or responsibility.

Such activity not only generates unnecessary server load but can also pose security risks, distort analytics and statistical data, and negatively impact a site's visibility in search engines.

Take advantage of:

  1. Structured IP and user agent lists available for download
  2. Information on origin, company, identification and activity
  3. Clear tables for fast filtering and blocking

 

  Detailed Bot Catalog: Company Logos, Identifications, User Agent Details and IP Information

 Artificial Intelligence

The "Artificial Intelligence" category lists bots and crawlers that automatically collect website content. These systems are used to train AI models and frequently operate outside legal or ethical boundaries.

LogoCompanyWebcrawler / BotUser-Agent(s)LinesIPv4Lines
AmazonbotAmazonbot35
ApplebotApplebot319
Bytespider, 
TikTokSpiderBytespider
TikTokSpider
45121
CCBotCCBot36
VelenPublicWebCrawlerVelenPublicWebCrawler38
YaKYaK35
AdIdxBot, 
BingBot, 
BingPreviewAdIdxBot
BingBot
BingPreview
6515
DotbotDotbot46
NeevabotNeevabot34
ChatGPT-UserChatGPT-User3449

 Bots

Search engine bots automatically scan the web to index website content. They are essential for web visibility but can strain server resources under high load.

LogoCompanyWebcrawler / BotUser-Agent(s)LinesIPv4Lines
RainBot35
AhrefsBotAhrefsBot3304
PetalBotPetalBot5227
BarkrowlerBarkrowler310
Cốc CốcCốc Cốc4112
Initdex-BotInitdex-Bot36
DataForSeoBotDataForSeoBot36
DuckDuckGo-Favicons-BotDuckDuckGo-Favicons-Bot314
CheckMarkNetworkCheckMarkNetwork33
Googlebot, 
Google Favicon, 
Googlebot-ImageGooglebot
Google Favicon
Googlebot-Image
9170
InfoTigerBotInfoTigerBot33
MJ12botMJ12bot359
MojeekBotMojeekBot33
QwantifyQwantify554
fluid33
SemrushBot, 
SemrushBot-BASemrushBot
SemrushBot-BA
551
SeznamBotSeznamBot554
SeekportBot, 
Seekport CrawlerSeekportBot
Seekport Crawler
6521
SurdotlyBotSurdotlyBot33
AwarioSmartBotAwarioSmartBot48
BLEXBotBLEXBot34
YandexBot, 
YandexFavicons, 
YandexImagesYandexBot
YandexFavicons
YandexImages
6579
ZoominfoBotZoominfoBot330

 Crawlers

Crawlers automatically scan websites to collect data. While useful, they may heavily burden server resources.

LogoCompanyWebcrawler / BotUser-Agent(s)LinesIPv4Lines
AdsTxtCrawlerTP34
MegaIndex.ru/2.0MegaIndex.ru/2.033
ev-crawler, 
e.ventures Investment Crawlerev-crawler
e.ventures Investment Crawler
48
EBID AG CrawlerEBID AG Crawler33
LinespiderLinespider35
Netestate Ne CrawlerNetestate Ne Crawler34
BirdcrawlerbotBirdcrawlerbot33

 Miscellaneous

Miscellaneous refers to services that have not yet been assigned to a specific category.

LogoCompanyWebcrawler / BotUser-Agent(s)LinesIPv4Lines
Facebook External User AgentFacebook External User Agent310
Github CamoGithub Camo457

 Readers

Readers extract and aggregate content automatically. Some may ignore copyrights or usage restrictions.

LogoCompanyWebcrawler / BotUser-Agent(s)LinesIPv4Lines
FeedlyFeedly35