Commit graph

261 commits

Author SHA1 Message Date
doomedraven
9774622914
Add CensysInspect
167.248.133.40 - - [03/Jan/2021:09:19:31 +0000] "GET / HTTP/1.1" 301 169 "-" "Mozilla/5.0 (compatible; CensysInspect/1.1; +https://about.censys.io/)" "-"
2021-01-03 10:26:42 +01:00
Łukasz Rżanek
629429aacc
Update bad-user-agents.list
Adding:
* Cocolyze bot: `Mozilla/5.0 (compatible; Cocolyzebot/1.0; https://cocolyze.com/bot)`
  Actually, whatever it is. I cannot access the given link from any location (tried four different networks!). I always get the Cloudflare lock page. So, sorry, bad crawler as it doesn't even want me to identify it.
* ZoomBot/LiknBot - different user agents!, i.e.: `ZoomBot (Linkbot 1.0 http://suite.seozoom.it/bot.html)`
  No information if they obey robots.txt, because when I go to the given URL I am informed the website is only available for paying customers. So, sorry, no...
* Velen - `Mozilla/5.0 (compatible; VelenPublicWebCrawler/1.0; +https://velen.io)`
  Link takes me to a Google 404 page. Strange. Crawling my website like crazy, not obeying (as far as I can tell) robots.txt.
2021-01-02 15:46:38 +01:00
Travis
18e469d230 V4.2021.01.2230 [ci skip] 2021-01-02 15:47:42 +02:00
Mitchell Krog
61bf0219af
Update bad-user-agents.list
REF: https://github.com/mitchellkrogza/nginx-ultimate-bad-bot-blocker/pull/416
REF: https://github.com/mitchellkrogza/nginx-ultimate-bad-bot-blocker/pull/417/files
2021-01-02 15:35:54 +02:00
Travis
1e039f3e16 V4.2020.12.2193 [ci skip] 2020-12-14 09:48:49 +02:00
Mitchell Krog
3a39d469c0
Update bad-user-agents.list
Another pesky crawler - Pandalytics/1.0
2020-12-14 09:35:57 +02:00
Travis
93a8fb97b3 V4.2020.12.2187 [ci skip] 2020-12-04 11:13:21 +02:00
Mitchell Krog
b2bdbe6c44
Update bad-user-agents.list 2020-12-04 10:18:28 +02:00
Mitchell Krog
72c4808c12
Update bad-user-agents.list 2020-11-24 08:49:02 +02:00
Travis
b5bd695219 V4.2020.07.2108 [ci skip] 2020-07-28 09:16:08 +02:00
Mitchell Krog
9a09d72cbf
Update bad-user-agents.list 2020-07-28 09:04:31 +02:00
Travis
3802d3dc9a V4.2020.06.2081 [ci skip] 2020-06-15 11:25:39 +02:00
Mitchell Krog
5a9c2cfcb5
+ Bad Bot - Mozlila 2020-06-15 11:08:03 +02:00
Travis
5594e36132 V4.2020.04.2055 [ci skip] 2020-04-26 10:33:38 +02:00
Mitchell Krog
041661dea9
+BOT - Closes: #378
domainsproject.org
2020-04-26 10:21:43 +02:00
Mitchell Krog
ce7c133e1d
Fix Spelling of Aspiegel 2020-04-15 12:10:53 +02:00
Mitchell Krog
52174ef11b
Moving zh_CN zh-CN to limited bots
REF: #366
2020-03-14 09:23:54 +02:00
Travis
41078a2384 V4.2020.03.2017 [ci skip] 2020-03-12 15:27:40 +02:00
Mitchell Krog
2972c957d3
+ Aggressive Chinese Scrapers Closes: #364 2020-03-12 15:15:30 +02:00
Ronie Martinez
de86fc75c8 Add "Ankit" and "polaris version" to bad-user-agents.list 2020-02-26 23:32:29 +08:00
Travis
50732ab1ac V4.2019.10.1867 [ci skip] 2019-10-16 09:53:04 +02:00
Mitchell Krog
86214d08f8
Update Bad Referrers | Update Bad User Agents 2019-10-16 09:42:46 +02:00
Mitchell Krog
a576e3457d - [UA] MS Web Services Client Protocol
REF: https://github.com/mitchellkrogza/apache-ultimate-bad-bot-blocker/pull/132
2019-08-16 14:24:00 +02:00
Mitchell Krog
1288e4870a
+ RSSingBot
+ RSSingBot
2019-08-03 17:11:27 +02:00
Travis
9fa8feb7c2 V4.2019.06.1584 [ci skip] 2019-06-25 08:16:05 +02:00
Mitchell Krog
e5e2946dd1
+ Bad Bot / Thumbor/6.4.2
thumbor is a smart imaging service. It enables on-demand crop, resizing and flipping of images.

It also features a VERY smart detection of important points in the image for better cropping and resizing, using state-of-the-art face and feature detection algorithms (more on that in Detection Algorithms).

https://github.com/thumbor/thumbor
2019-06-25 08:10:26 +02:00
Travis
314995230f V4.2019.06.1583 [ci skip] 2019-06-24 20:42:42 +02:00
Mitchell Krog
c96e78086f
+Bad Bot OutclicksBot 2019-06-24 20:36:53 +02:00
Mitchell Krog
4e325e0f09
Remove Slackbot-LinkExpanding False Positive 2019-06-23 16:42:11 +02:00
Mitchell Krog
3ce74818eb
Remove Bubing from bad-bots Duplicate as it's in rate limited BUbiNG 2019-06-23 11:23:26 +02:00
Travis
0a4b3b0efd V3.2019.06.1437 [ci skip] 2019-06-20 09:35:09 +02:00
Mitchell Krog
b64214033b
Update bad-user-agents 2019-06-20 09:31:47 +02:00
Travis
cc1b102dad V3.2019.06.1436 [ci skip] 2019-06-20 09:11:00 +02:00
Mitchell Krog
5973deddb6
Update bad-user-agents 2019-06-20 09:07:42 +02:00
Travis
f580816ed0 V3.2019.06.1435 [ci skip] 2019-06-19 18:25:29 +02:00
Mitchell Krog
e6eea71f25
Update bad-user-agents 2019-06-19 18:19:45 +02:00
Travis
0925bff34c V3.2019.06.1434 [ci skip] 2019-06-19 10:22:22 +02:00
Mitchell Krog
28d3fc67e8
Update bad-user-agents 2019-06-19 10:19:12 +02:00
Travis
675260442d V3.2019.06.1433 [ci skip] 2019-06-19 10:00:25 +02:00
Mitchell Krog
4a364cc22f
Update bad-user-agents 2019-06-19 09:56:56 +02:00
Travis
e4884c8a26 V3.2019.06.1432 [ci skip] 2019-06-19 09:53:37 +02:00
Mitchell Krog
8eba38de21
Update bad-referrers / Update bad-user-agents 2019-06-19 09:49:54 +02:00
Mitchell Krog
c895de7ddd
Remove x22 cases - Require Exact Matching 2019-04-02 11:05:27 +02:00
Travis
32c69933b0 V3.2019.04.1358 [ci skip] 2019-04-01 13:31:58 +02:00
Mitchell Krog
a6e0e00f05
Update bad-user-agents.list + bad-referrers.list 2019-04-01 13:29:33 +02:00
Travis
e5db42a129 V3.2019.03.1355 [ci skip] 2019-03-28 11:53:17 +02:00
Mitchell Krog
74f4d6f9cb
Update bad-user-agents.list 2019-03-28 11:21:28 +02:00
Mitchell Krog
48e99f5b29 Add Mediatoolkitbot
Closes: #245
2019-01-25 15:10:51 +02:00
Ondrej Simek
6b9c46f47f Remove Seznam.cz from bad user agents. 2019-01-11 11:05:01 +01:00
Travis
a2aeeb4932 V3.2018.08.1176 [ci skip] 2018-08-24 16:57:07 +02:00