|
|
|
|
Scraping Every Product on Amazon to Make a Million Page Affiliate Site |
|
|
|
There are no conversations. |
|
|
cauz |
May 12, 2017, 9:38 p.m. |
|
|
|
Helge Ingstad |
It was very clear that this was a very, very old site. There were remains of sod walls. Fishermen assumed it was an old Indian site. Bu Indians didn't use that kind of buildings and houses. |
Mike Davidson |
Our old site did not have very good support for the disabled, but our new site should soon have much better support. With all of our content in divs now, we can hide all but the relevant chunks of content and navigation with a simple alternate CSS file. |
Richard M. Daley |
There has been loss of steel manufacturing. Those people need jobs. Where you have to build the third airport is where people are. So you're right; if his site isn't playable, then our site is right next to it. |
Jonathan Ive |
When you're trying to solve a problem on a new product type, you become completely focused on problems that seem a number of steps removed from the main product. That problem solving can appear a little abstract, and it is easy to lose sight of the product. |
David Ben-Gurion |
There are eleven million Jews in the world. I don't say that all of them will come here, but I expect several million, and with natural increase I can quite imagine a Jewish state of ten million. |
Stephen Hawking |
If the rate of expansion one second after the Big Bang had been smaller by even one part in a hundred thousand million million, it would have recollapsed before it reached its present size. On the other hand, if it had been greater by a part in a million, the universe would have expanded too rapidly for stars and planets to form. |
James Fenton |
Writing for the page is only one form of writing for the eye. Wherever solemn inscriptions are put up in public places, there is a sense that the site and the occasion demand a form of writing which goes beyond plain informative prose. Each word is so valued that the letters forming it are seen as objects of solemn beauty. |
Adam Garcia |
I love travelling, and had the pleasure of being in the most developed country in the world and then parts of two of the most pristine natural areas of the world: the Galapagos islands and the Equador Amazon jungle. The contrast was incredible. |
Bob Iger |
Netflix, Amazon, iTunes - whatever platforms emerge - we are looking at as having the same potential that home video had for the movie business. Which means there are entirely new opportunities to monetize our capital investment in content and do so in ways that work for distributors, for consumers and for creators. |
Evan Daugherty |
I wouldn't say I see things visually first, but what I do think is important, for a lot of screenwriters, is to not just think about the words on the page, but also the world as a whole and the vibe of the movie, rather than a sequence of scenes written on the page. |
|
|
now that my list of product ids is in the millions and ive used about 40gb of proxy bandwidth scraping maybe 50k pages from that data, i have to carefully weigh out how much i want to spend on proxies (spent about $30) on this experiment that could result in just a simple takedown notice to stop the method. granted i can always reuse and modify this data. but i guarantee if you had a million page site based directly around real ecommerce products you would make good money if it stays up
|
|
|
|
Referring a user to amazon through your affiliate link gets you 24 hours to 90 days tracking cookie where you can earn commission on anything the user purchases in the time period from Amazon, the biggest online store in the world. 1 million product pages will bring long tail search traffic and careful analytics will reveal the most promising niches/products which new hyper focused niche sites can be created around.
This post is a comment.
|
|
|
|
have more than a half million product urls (which is really the hard part with amazon, they make it extremely difficult for scrapers to crawl their entire site). after cleaning up this list and potentially trying to get even more products, i will continue to modify my php scraper, this time with use for amazon. it rotates through proxies and user agents so it has worked well in google maps, yelp,. and your university's student directories, so it should bypass amazons no problem. my scraper nowadays saves all the data into xml so i can import through certain plugins, but also have a super easy way to convert to any form i need. originally my scraper rotated through tor proxies and saved all data directly into mysql, over time i created sql files for importing and now that wordpress is used so extensively and doesnt recieve penalties in the search engine like it used to, i can just throw all the data in there and make as many copies and variations of the sites as i want. and make it loo...
This post is a comment.
|
|
|
|
oih yes. also, now a days i run my scripts from a server or even my localhost through WGET and remove the output i use for testing. also another reason i use xml and import into wordpress is because they can manage a database of that sizes efficiency way better than i can. i tried to make a million page site a long time ago and it would take for ever to load my data i put in mysql directly off the scraper
This post is a comment.
|
|
|
|
one of my modules silently edits the registry for a systemwide web proxy. i'm looking for a good solution that might change amazon pub-ids, adsense ids, and most importantly redirect according to what site they are visiting. i don't care about banking, i'm already grabbing passwords. i want to be able to dynamically inject javascript into every page they visit etc
|
|
|
|
its official. amazon manually reviewed my site and determined i had no original content lol. but. it did make a few bucks (that unfortunately i wont be seeing)
|
|
|
|
Amazon Is Finally Profitable, Earns $2.5 Billion Over the Last Three Months
Amazon topped $2 billion in quarterly profit for the first time in its history, an impressive run fueled by continued growth in Prime subscriptions, cloud computing and its nascent advertising business. Amazon said Thursday that it earned $2.5 billion in profit for the three months ending in June, a staggering jump from the $197 million it posted in the same period last year. It marked the third consecutive quarter that Amazon has topped $1 billion in profit, a remarkable feat for a company once known for investing so much in its business that it often lost money. "The profitability trajectory appears to be accelerating quicker than expected," Daniel Ives, an analyst with GBH Insights, wrote in an investor note ...
|
|
|
|
Amazon Will Pay $0 in Federal Taxes on $11.2 Billion Profits (fortune.com)
Those wondering how many zeros Amazon, which is valued at nearly $800 billion, has to pay in federal taxes might be surprised to learn that its check to the IRS will read exactly $0.00. From a report:
According to a report published by the Institute on Taxation and Economic (ITEP) policy Wednesday, the e-tail/retail/tech/entertainment/everything giant won't have to pay a cent in federal taxes for the second year in a row. This tax-free break comes even though Amazon almost doubled its U.S. profits from $5.6 billion t...
|
|
|
|
this is a super old idea, even played out. but i still think i can make a better site that still recieves search traffic because im awesome at grey hat seo and making a brandable domain with a responsive site. im great at spinning content too and appearing to be a completely normal site. which it is, other than how big i scale the ideas
This post is a comment.
|
|
|
|
So I scraped 450k amazon product urls, and now i finally finished writing my scraper and finally kicked it off with some fresh proxies. downloading massive amounts of data and images from the big A hole
|
|