That is awesome that Zaphod is including your custom signatures in his next update! You should be very proud! And I want you to know that we appreciate what you do here and in sharing your knowledge with us.
Only in rare instances did he learn these from me. He's been finding user agents and adding. So, he's finding some I found-- and more. It's very rare that I find one before he does.
The forum over there is not motivated by copyright issues. But they are motivated to prevent hacking and also save people bandwidth. So lots of things get blocked merely because there is no benefit to a website owner to permit the bot.
For example: Think about 80 legs. It might be visiting your site 10 times a second to find a copyright violation. It might be visiting because one of your competitors wants to learn something about your site -- to his advantage. It might be visiting because it lets people use it for free and someone who doesn't like you might decide to set it on you just to pester you. And so on. So... why do you want to let this thing crawl?
Their blog suggests stupid things like: Maybe you are a blogger and you want advertisers to be able to serve your visitors better ads. Uhmm....My blog doesn't even have ads anyway. But suppose I did. If I had an account running banner ads with a particular ad agency, maybe
that ad agency could tell me what spider he uses and I could let
that spider visit. Why should I want to let 80 legs crawl on the hypothetical theory that some ad agency somewhere in the world could make the ads I might deliver more responsive to my visitors?!
There are all sorts of bots like that around. Plus-- the fact is-- no matter what they say they are doing, you don't really know what they are doing. But they are sucking your bandwidth.
I'm at the point where if I can't figure out what a bot is, it's banned. If they leave a link to a non-existant web page? Banned. Web page is impossible to understand? Banned. We page says 'seo', 'reputation' etc? Banned. Email to contact them? Banned until they answer the email.
I have nothing against seo-- but lots of those bots are just voracious!
But for an individual: When blocking, do think about what your business is. For lots of businesses, it might sometimes be useful to just block entire countries. For example: My hairdresser has a web site for convenience of customers. She could block everything outside the USA without harming herself at all! On the other hand, as a blogger, I don't want to block everything outside the US.