Oh thanks ! I had not looked there. It's bingbot : it is authorized, but blocked by the access limits specified in the configuration.
Offline
OHappyDay wrote:
Look at the associated IP address. If it is 54.36.148.121 then it belongs to the ahrefs.com bot (it is a marketing company).
Hello OHappyDay.
I have the following information about it:
The AhrefsBot visits your website regularly and downloads many (sometimes all) pages, images, and resources. In essence, it performs the following tasks:
What exactly does AhrefsBot do on your website?
It crawls your page content
Reads HTML content, titles, meta data, headings, text content, and internal links.
Goal: Structural analysis of your website for SEO purposes.
It scans your backlinks
Identifies which external websites are linking to your pages.
This information is used in Ahrefs tools like the Backlink Checker.
It analyzes keywords and rankings
Attempts to determine how relevant your content is to specific search terms.
These insights help Ahrefs users understand your search engine rankings.
It tracks internal linking
Detects how your pages are linked internally.
Helps evaluate the architecture of your website.
It downloads resources
CSS, JavaScript files, images, or fonts – depending on the page.
Goal: A full understanding of the page (e.g., for load time analysis).
What does this mean for you?
Advantages:
- If you use Ahrefs, you'll get valuable data about your SEO performance.
Your site will be analyzed more frequently, making its structure and visibility more transparent for you and your SEO team.
Disadvantages:
- Server load: Heavy crawling can slow down your site.
- Data exposure: Your backlinks, content, and structure become publicly accessible through Ahrefs – including to competitors.
- Bandwidth usage: Especially on shared hosting, AhrefsBot may consume unnecessary resources.
- Privacy and rights: Even content not meant for public indexing (e.g., forgotten “noindex” tags) may be scanned unless properly blocked.
Olaf
Offline
Katryne wrote:
Olaf, there is a robot that the plugin stopped 109 times, but I cannot identify it because the line does not display enough characters...
Mozilla/5.0 (Linux; Android 7.0;) AppleWebKit/537.36 (KHTML, like Gecko) Mobile Safari/537.36 (compa 109
Mozilla/5.0 (compatible; AhrefsBot/7.0; +http://ahrefs.com/robot/) 2
Edit : maybe it's not the same robot, but several which begin with the same characters ???
Hello Katryne,
Please add the following to your list and test it ->
Blocked User Agents:
AhrefsBot
compa
compa 109
Android 7.0
Last edited by Schneider-Fotografie (2025-06-08 20:02:37)
Offline
You sure ? compa is not a user agent, it's the beginning of compatible and the rest of the line cannot been displayed.
Offline
Katryne wrote:
You sure ? compa is not a user agent, it's the beginning of compatible and the rest of the line cannot been displayed.
Sorry Katryne,
you're absolutely right. Please don't enter "compa"!
I have the following in my list, and it's enough for me. As a result, I have 7656 blocked requests, from today alone!
Blocked user agents:
SemrushBot
AhrefsBot
MJ12bot
DotBot
crawler
spider
python-requests
curl
wget
YandexBot
Baiduspider
masscan
PetalBot
AspiegelBot
SemrushBot
Last edited by Schneider-Fotografie (2025-06-08 20:35:07)
Offline
I block these ones :
SemrushBot
AhrefsBot
MJ12bot
PetalBot
DotBot
Yandex
BaiDuSpider
Mail.ru
BLEXBot
SEOkicks-Robot
amazonaws.com
I am not sure how to enter amazonaws.com and Mail.ru.
Maybe I'll add the generic crawler and spider
Offline
Katryne wrote:
I block these ones :
SemrushBot
AhrefsBot
MJ12bot
PetalBot
DotBot
Yandex
BaiDuSpider
Mail.ru
BLEXBot
SEOkicks-Robot
amazonaws.com
I am not sure how to enter amazonaws.com and Mail.ru.
Maybe I'll add the generic crawler and spider
The typical Mail.ru user agent is:
Mail.RU_Bot
However, there are variants and obfuscated versions.
Recommendation: Simply enter both variants:
Mail.RU_Bot
Mail.ru
Please let me know if it works!
Offline
I have to report a problem with bot protection:
When I try to connect/login to my website through the piwigo app on my Android mobile device I get blocked.
I have no idea how to resolve this. Deactivating the plugin is not an option now that it works so good.
In the log I see this:
Missing referrer and cookies UA: Dart/3.4 (dart:io)
Ideas?
Klaus
Last edited by OHappyDay (2025-06-08 21:39:42)
Offline
OHappyDay wrote:
I have to report a problem with bot protection:
When I try to connect/login to my website through the piwigo app on my Android mobile device I get blocked.
I have no idea how to resolve this. Deactivating the plugin is not an option now that it works so good.
Ideas?
Klaus
Enter your IP address in the Allowed IP Addresses section, and it should work:
Enter one IP address per line. These will be excluded from bot restrictions.
Offline
I am aware of that possibility but such IP addresses will change over time and they also depend on which network I am connected with (home wireless, other wireless connections when not at home etc.).
Offline
I found a solution: I added the identification for the app ("Dart") to the file whitelisted_bots.txt. Now the app can connect to the website.
Offline
OHappyDay wrote:
I found a solution: I added the identification for the app ("Dart") to the file whitelisted_bots.txt. Now the app can connect to the website.
Exactly, you can then add Dart and Piwigo to the whitelist later!
Olaf
Offline
Schneider-Fotografie wrote:
Blocked user agents:
...
curl
wget
...
Fwiw, curl and wget are not bots, they are command line tools that may be used in several contexts.
Offline