Click Official ELI Links
Get Help With Your Extortion Letter | ELI Phone Support | ELI Legal Representation Program
Show your support of the ELI website & ELI Forums through a PayPal Contribution. Thank you for supporting the ongoing fight and reporting of Extortion Settlement Demand Letters.

Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - lucia

Pages: 1 ... 22 23 [24] 25 26 ... 44
346
Getty Images Letter Forum / Re: Image scraping: The Newsblur angle.
« on: August 18, 2012, 09:26:41 AM »
Jerry,
First: On this thread, my main intention is to alert readers that they should be aware that it is now easier for image scrapers to find images.  This is separate from the copyright issue and merely has to do with the way this group is displaying stuff.

I'm not sure how everyone is going to detect this-- I know how I am. Among other things, in .htaccess, if the referrer is newsblur,  any uploaded blog images will be replaced by pictures of a cat.  You can see this if you load http://www.newsblur.com/site/1100897/ which currently seems to default to "feed". If not, click feed. Scroll down. You'll see cats. (I can also detect that I have an imperfection in my redirection because that first graph which is a '.png' ought to also be a cat, but it's a graph.)

Second: On the copyright issue: I am going to send a more formal cease and desist. But here's the background:

On newsblur's site, FAQ says to 'opt-out' we should email Clay Samuels the owner, founder, coder. I emailed and told him to take me off. He sent me a sales pitch telling me how great his service was and actually told me he was my best friend because he was copying my material in part.   I repeated my request he stop. He did not respond to this email in any way. Silence.  This pissed me off.

On Monday, I blogged, and I made changes to my page so that people viewing the fresh copies read my opinion about the practice and are autoforwarded to my real site. (This is called the "ass-hat message" which is written in javascript. To see it visit http://newsblur.com/reader/page/1100897.  That's what people view if they click to the "story view" in the previous url which I sent you. which is the one I have asked him to take down and also the one that involves copying. You can see by scrolling down here:
http://www.newsblur.com/site/1100897/


On Tuesday I tweeted about the page display and the owner of newsblur made some quick changes which supposedly intended to eliminate the problem. He tweeted back that he had made changes to prevent loading. I tweeted back that copies were still displaying.  Fresh copies continue to be made.  (Although initially I thought they had stopped-- but I was mistaken. What had happened was merely that Newsblur has copied a display showing that their IP had been banned at Cloudflare. It looked pretty funny actually.)

Later, I noticed the newsblur continued to show fresh copies. I did some tweaking to verify that
a) their bot does not  visit robots.txt (which would have told them their visits are disallowed)
and
b) their bot does not  visit the "noarchive" metatag.)

This means that as far as I can tell, there is no way for my server to communicate my wish to "don't visit here" and "don't copy" to their bot.  And the bot just copies.

I have also been taking fresh snapshots of his copies and my  pages because at this point I anticipate that he might continue to fail to stop copying.  In preparation for that I want to have a packet of "stuff".  But my plan forward is:
1) Send him another email cease and desist.
2) If he does not cease and desist, send a DMCA notice to either his hosting company, his name server or both.

But before I do (2) I want to be certain that I have evidence in place that should he dispute my take down, I would have proper evidentiary materials so that I win and my court costs are covered.  That's why I am asking people what I should log etc.

I also wouldn't mind if people might suggest whether they can think of any reason in the world why I might fail to win a case in copyright court. Because I certainly don't want to go to court and lose.

347
Getty Images Letter Forum / Re: Image scraping: The Newsblur angle.
« on: August 17, 2012, 05:24:12 PM »
Isn't innovation great?  Love the fact that I can use the multitudes of RSS readers out there and track my favourite blogs; unfortunately this makes the job so much easier for trolls - they don't need to spend the money (or at least not a lot of money) on building their own infrastructure - they just use someone elses.
I can't think of a single way normal RSS readers would make it easier for trolls to find things.  Newsblurs novel new method of presenting material to the public would make it easier.

348
Mulligan--
I"m just upset generally.

Unfortunately, the copyright issue can be especially frustrating if you have a blog for creative expression which is not run for profit and also want to control your content.  From the point of view of a content creator there are all sorts of things that are bad about the way newsblur presents the content.  Only one of the bad things involve copying. 

349
Getty Images Letter Forum / Re: Image scraping: The Newsblur angle.
« on: August 17, 2012, 01:59:39 PM »
Moe--
What I am describing is party A making it much, much easier for party B to scrape party C. (I would be C.)  It happens that one of the things party A is doing is copying my site without permission and posting the copy in a public place.  And I don't like that copying for many reasons. But what I am trying to explain is that one of the reasons I don't like it is that it facilitates party B.  So I would like party A to cease and desist copying-- just as I wrote on the other thread.  Party A is not scraping.

As for any parties who scrape: I am already doing what I can to impede that. I do more or less the same things I was previously doing. But because of party A is making copies of my blog available in a certain specific way, a method that could make B's scraping attempts scraping easier has been created.  B will find it easier to scrape my site. If you have a blog and  A has copied your blog, B will find it easier to scrape your site. Same with Budhappi's blog and so on.  Possibly none of you will notice the scrape.

I might intercept it. Or not.

350
Getty Images Letter Forum / Re: Image scraping: The Newsblur angle.
« on: August 17, 2012, 01:52:38 PM »
Moe--
What I am describing is party A making it much, much easier for party B to scrape party C. (I would be C.)  It happens that one of the things party A is doing is copying my site without permission and posting the copy in a public place.  And I don't like that copying for many reasons. But what I am trying to explain is that one of the reasons I don't like it is that it facilitates party B.

As for any parties who scrape: I am already doing what I can to impede that.

351
Legal Controversies Forum / Re: INteresting Friday nugget
« on: August 17, 2012, 10:07:10 AM »
Shoot. Can I put that on my blog?  :)

352
Greg--
By the way, the existence and display of copies of blogs at that site happens to represent a gold mine for copyright trolls with very modest programming skillzzzzz. On the getty thread, I posted what a copyright troll should do... today... at that site.

353
Getty Images Letter Forum / Image scraping: The Newsblur angle.
« on: August 17, 2012, 09:10:47 AM »
I want to stop a moment and explain how a trove of blog "snapshots" (aka full copies of html) displayed by a third party (like Newsblur) can be used by a a picscout / getty / image scraper / copyright troll to efficiently find and inspect image. There are two ways I thought of relatively quickly and there may be more. Both are methods would be much easier to code and implement and would save a bot a lot of time relative to what exists absent a trove of conveniently supplied "snapshots". 

If I were an image bot owned by a copyright troll, I would do this right now:
Method 1) Write a crawler to progressive visit copies of site with addresses like http://newsblur.com/reader/page/1 (Give it a try. ) 

I would compare each image at that site to my trove of images.  That would essentially show my bot the header and "decoration" images that display in any blog plus any images that happen to be displayed on the top page on the day my bot visited.

After visiting 1, now load http://newsblur.com/reader/page/1 . (My site number is up above 1,000,000. So you'll want a bot. But it really is a trove.) 

So, the existence of all these copies displayed publicly is a very handy thing for copyright trolls.  The copy of my site is unauthorized. And this sort of thing is one of the reasons I am really  pissed off.

Method 2) If copyright troll, I would also do the following:
Program a bot to visit http://www.newsblur.com/site/1.    Code whatever is required to make the site believe you have clicked "feed".  Cause the "scroll bar" to scroll down... keep scrolling... keep scrolling.  (Both could be coded but requires programming. I'm a poor programmer, it would likely take me a day.  A good programmer could do it more quickly. I'm sure the guys at Picscout can do it as soon as they read this post.  That is assuming they haven't already learned of the existence of Newsblur in which case, figuring out that they could do it and how to do it would take them... on... 2 hours? )

Once this is done, the bot can them load every image in every blog post the blogger every posted.  It "sees" every single address for every image and loads them the way a browser would.

None of this should make us happy.

That fact that I could think of these methods very quickly (and did) contributes to why I am very, very upset by the possibility of losing control of how my site displays.  To the extent that the copying is involved, this is a copyright issue. But to the extent that it facilitates images scraping, it is a "getty copyright troll" issue.

354
BTW: I don't want to suggest that I can dictate what Oscar should be interested in.  He is interested in what he is interested in for his own reasons. But I will admit as a small hobby blogger to being very upset that someone with funding is copying my stuff in an rather non-industry standard way that I oppose. And even though I believe he should not copy my stuff in the way he is doing and I very, very strongly suspect it violates copyright, pursuing it would be inordinately expensive.

I should add that it doesn't help my mental state to know this guy got an idea into his head, coded, and the "brilliant" idea seems to have been welcomed by the coding community. He's gotten subscribers and now has a small business. He recently quite his job to develop the business.  I do not welcome the notion that to protect my interest I would need to pursue a case that represent a serious reversal to his business.

But really--  I don't see how it is the case that if other people spend time thinking about use the web in a way that efficiently monetizes their time that means that I am forced to lose all control of whether my stuff is copied and displayed.

Ok.. sorry. I'm upset.  ....

355
Lucia, wouldn't it be easier (and perhaps smarter) and certainly less time-consuming to simply monetize in some way each blog post you make instead of getting into what could become a pissing contest with a fellow programmer, where you both end up spending all kinds of time trying to outwit the other? 
1)  I don't see this as a pissing contest with a fellow programmer. For one thing I am not a programmer. By training I am a mechanical engineer who mostly blogs about climate change as a hobby. (I previously blogged about knitting and I would be just as p*ssed off about what he's doing if this were my kntting blog.)

2) Just because blogging is a hobby for me, I don't see any reason why that means I lose copyright protection so that someone <i>else</i> can repackage and monetize my content without sharing any of the money with me.

3) tell me how to easily monetize blog posts on climate change without alienating the audience I cherish so the so the money ends up in my pocket.

4) Suppose I could monetize: If so, this copying would possibly reduce my earnings.

5) Suppose you can't think of any good way for me to monetize while retaining the audience I cherish. Does that give someone else the right to "sell" his wrapper when it is only valuable because it is wrapped around my (an other hobby bloggers ) contents? 



In just about every blog post I put up, I have a link to something that helps me pay the bills around here as well as a link to an article or another blog post on one of my other sites. For me, much of my time is spent writing to generate traffic and name recognition.
That's what you do. So? What does that have to do with me?  My site is mostly a hobby blog.  My time is spent writing about thing that I am curious about. And it happens I attract quite a bit of traffic. (That is: when I write about climate change as opposed to other stuff. The posts with the equations generate tons of traffic. Can't use them to sell much of anything-- but so?)

As for name recognition, I don't want to get into a pissing contest. Maybe you have better name recognition than I do. Or not. Maybe you value name recognition. I don't see that as remotely relevant here. 

You're a lot smarter than I am, so I only throw this out as a suggestion to maybe help you save some time and energy trying to defeat a trend of taking other peoples' work without permission, a trend that I don't see disappearing... and in a way, that's good, because that's kind of the idea the Internet was founded on anyway... free information for everyone.
First: Was the internet founded to create free information for everyone? Says who?  I've got tons of "sockulators" and other free stuff out there. I'm happy to pay hosting costs to let knitters use those for free.  I don't see how then fact that lots of people post lost of things for free obligates me to permit others to make money by bundling up my stuff without sharing some of the money with me.

I happen to think that this notion that everything is free on the internet is a double edged sword.  The fact that it is difficult for people who put out good valuable information have their stuff taken without any thought to remuneration is not "good".  I don't know if my stuff is any "better" than yours-- but I get quite a bit of traffic.  No matter what I've blogged about, I've gotten traffic. (Knitting got me the most-- but I became more interested in climate change.)

But it's traffic that is difficult to monetize because it has nothing to do with saleable products. That's ok with me. BUT that doesn't mean I want someone else to create a saleable product by finding all the interesting people who don't make money, putting a bow on it and selling it.


I think one of the key principles for earning a living on the internet is to never spend a minute worrying about copyright and stealing but instead to figure out how to embed my work with clever stuff so that innocent and well as intentional infringers and downright thieves end up helping me expand my brand, so to speak.
Oh? Is that some sort of key principle?

Anyway, I don't blog to "earn a living on the internet". I may blog for different reasons than you do. That's permitted.

Anyone can visit my site for free-- and if you visit the front page you'll see ZERO ads. You'll see ZERO links to marketable things.  It's my right to do this. The fact that I chose to not have ads or sell much of anything on my blog doesn't give this guy a right to bundle my content up in his "product" and then sell his product to people who pay "premium" packages of "feed reading".  The fact is: What he is selling is NOT a feed.  It's not going to bring me any eyeballs.  It's not going to bring me anything I value.   

I have no objection to his product provided his product isn't basically a "subscription" to read his copy of my stuff. If he wants to sell other people's stuff- not my business.  But I should think if his sois dissant 'feed reader' is all that great, he doesn't need to send a bot to copy my front page (not feed!) on a daily basis to sell the reader. 

Look: I recognize fair use. I use it. I recognize the value of created by people giving things away for free. I recognize that some people blog about subjects that can be monetized to make money. But none of these things mean that if I chose to blog about a subject that attracts lots of eyeballs but is somewhat difficult to monetize, then someone else should just be able to grant themselves permission to copy because they think they know how to monetize (provided they pay me nothing.)

I'm watching to see if this guy maybe figured out he failed to stop copying. If he failed, I'm going to check a few details that mattered in a google case.  Then, if he hasn't stopped, I'm going to send a DMCA. As for suing or asking for money: I would only do that as a last resort. But I might do it.

But really, this guys' business model is sufficiently tenuous that I think Oscar might be interested in aspects to discuss as hypotheticals in his copyright course! (Heck. I don't even thing some things are tenuous. The main questions: What can a copyright right owner do that is not basically putting themselves at risk of spending lots of their own money to pursue a "principle".)

356
Copying has resumed.   Sigh...

357
The copyright office will allow registration of photo compilations to provide individual protection to the images contained in the compilation but only if each photograph is identified separately and have a common author.
By "have a common author" do you mean "Joe Schmoe, photographer" could register a compilation of 10 photos all taken by "Joe Schmoe" provided each photo is identified separately?

358
He's stopped now. :)

359
The site also appears to function as follows:
1) at the lower part of the outerframe, you can see "orginal", "feed" and "story" views.  The "original" which as far as I can tell always displays by default frames what appears to be a copy of my content. That copy is hosted at "it's not my server" site. 

2) The feed... I don't know. I think it's probably a copy. But that's ok.  It's pretty conventional for feed readers to store feeds. (That said, many require users to log in to view. They don't just hang out another copy of the feed. )  Anyway, it might be a copy or a frame. If it's a copy, I think the Oscar could debate whether copying is a violation in his copyright class. But I wouldn't care. So, wouldn't matter.

3) The story view: These appear to be material hosted on my server in an iframe. Because it is, I busted that by merely sending <?php
header(‘X-Frame-Options: DENY’);
?>.    This is easy to "bust" because they are framing material that is hosted on my server-- and as you observe, that's not a copyright violation.

The problem is #1.  I don't like that.

Mind you: I would like them to pay me money to do #3. But framing stuff on my server causing a browser to visit my server and fetch it is not copying.  So... that's not what's got me hot and bothered.

360
Quote
Is the content being "copied" in an IFrame?

It appears a bot visits the top page of my blog http://rankexploits.com/musings copies the html, hosts it on a server-that-is-not mine. They then frame this copy and display it to people visiting their site.

I injected javascript which displays when either top.location or self.location do not match "rankexploits.com"

http://www.newsblur.com/site/1100897/

The framed page appears to be this address: Note: not mine.
http://newsblur.com/reader/page/1100897

I'm going to post so you have a chance to look. ( I tweeted, and if the person who is copying is wise, he'll either a) take down the copy or b) explain that it's not a copy.

Pages: 1 ... 22 23 [24] 25 26 ... 44
Official ELI Help Options
Get Help With Your Extortion Letter | ELI Phone Support Call | ELI Defense Letter Program
Show your support of the ELI website & ELI Forums through a PayPal Contribution. Thank you for supporting the ongoing fight and reporting of Extortion Settlement Demand Letters.