全球主机交流论坛

 找回密码
 注册

QQ登录

只需一步,快速开始

CeraNetworks网络延迟测速工具IP归属甄别会员请立即修改密码
查看: 619|回复: 5

CF里显示GOOGLE BOT一小时爬10K的次数。。这正常吗

[复制链接]
发表于 2020-12-6 18:31:46 | 显示全部楼层 |阅读模式
本帖最后由 2019年 于 2020-12-6 19:04 编辑

RT...............我做英文站的

顺便分享一下网上找的很好的屏蔽bad BOT的规则,这个适用做英文站的哈,因为屏蔽了中国的一些bot
(http.user_agent contains "Yandex") or (http.user_agent contains "muckrack") or (http.user_agent contains "Qwantify") or (http.user_agent contains "Sogou") or (http.user_agent contains "BUbiNG") or (http.user_agent contains "knowledge") or (http.user_agent contains "CFNetwork") or (http.user_agent contains "Scrapy") or (http.user_agent contains "SemrushBot") or (http.user_agent contains "AhrefsBot") or (http.user_agent contains "Baiduspider") or (http.user_agent contains "python-requests") or (http.user_agent contains "crawl" and not cf.client.bot) or (http.user_agent contains "Crawl" and not cf.client.bot) or (http.user_agent contains "bot" and not http.user_agent contains "bingbot" and not http.user_agent contains "Google" and not http.user_agent contains "推特" and not cf.client.bot) or (http.user_agent contains "Bot" and not http.user_agent contains "Google" and not cf.client.bot) or (http.user_agent contains "Spider" and not cf.client.bot) or (http.user_agent contains "spider" and not cf.client.bot)

下图是我的CF介面,好BOT(浅蓝allow)的流量远超BAD BOT
发表于 2020-12-6 18:46:48 来自手机 | 显示全部楼层
提示: 作者被禁止或删除 内容自动屏蔽
 楼主| 发表于 2020-12-6 18:57:09 | 显示全部楼层
has 发表于 2020-12-6 18:46
楼上的规则会屏蔽谷歌吗

不会的,我使用了,很好用,
contains "Google" and not cf.client.bot,这个是屏蔽假google bot
发表于 2020-12-6 19:01:21 来自手机 | 显示全部楼层
为什么屏蔽 yandex sougo呢
发表于 2020-12-6 19:27:07 | 显示全部楼层
做啥站,我的都没bot来爬
发表于 2020-12-6 19:37:24 | 显示全部楼层
如果有的蜘蛛 没被 cloudflare收录呢
您需要登录后才可以回帖 登录 | 注册

本版积分规则

Archiver|手机版|小黑屋|全球主机交流论坛

GMT+8, 2024-4-26 09:23 , Processed in 0.062752 second(s), 9 queries , Gzip On, MemCache On.

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表