|
|
|
|
This list of 400k product ids include lots of copies of the same product with a different tracking number. im only getting maybe 30k off that list total. was gonna scrape more after so my next run of my id gathering, ill find better ways to remove redundancy and save some money. ive used almost 30g of bandwidth through those proxies the past few days. but i also download huge high rez images too |
|
|
|
There are no conversations. |
|
|
cauz |
May 17, 2017, 9:10 p.m. |
|
|
|
Leos Carax |
My films start with images, a few images and a few feelings, and I try to edit them together to see the correspondence between these images and these feelings. |
B.o.B |
A lot of artists go in the studio and say, 'OK, whaddaya want me to do? Is it gonna be a hit? I'll do it. Is it gonna get played on the radio? I'll do it.' So they start makin' these songs, and they fall in the same tempo, same category, same this, same that, and it'll just all sound the same. |
Dan Farmer |
Even if the music industry simply gave away all their music people would complain that they don't have the bandwidth to download all the stuff - the problem would merely shift from availability to distribution. |
Jack Canfield |
Write your goals down in detail and read your list of goals every day. Some goals may entail a list of shorter goals. Losing a lot of weight, for example, should include mini-goals, such as 10-pound milestones. This will keep your subconscious mind focused on what you want step by step. |
Stevie Jackson |
I got involved in lots of different areas round about 2007, 2008. Just working with lots of different people and stretching myself in different ways. I was working on art projects and working with other writers, just doing bits and pieces, trying to keep busy. |
Bruce Jackson |
First, those images help us understand the general and specific magnitude of disaster caused by the tsunami. The huge outpouring of aid would not have happened without those images. |
Dave Eggers |
When I was on the bestseller list with the first book, everyone who knows me knows that every week it continued to be on the list was a very dark week for me. Everyone knows that all I wanted was to be off that list. |
Jim Valvano |
We need your help. I need your help. We need money for research. It may not save my life. It may save my children's life. It may save someone you love. And it's very important. |
Chiwetel Ejiofor |
Different people approach the universe in different ways, but they also approach their own expectations in different ways. |
Henry Van Dyke |
Time is too slow for those who wait, too swift for those who fear, too long for those who grieve, too short for those who rejoice, but for those who love, time is eternity. |
|
|
i bet i could scrape the images using scrapebox with free proxies to save on costs. the only reason i used paid proxies for the data is because i want to be sure that it's US data to get US results for each product id. and theyre more reliable
This post is a comment.
|
|
|
|
now that my list of product ids is in the millions and ive used about 40gb of proxy bandwidth scraping maybe 50k pages from that data, i have to carefully weigh out how much i want to spend on proxies (spent about $30) on this experiment that could result in just a simple takedown notice to stop the method. granted i can always reuse and modify this data. but i guarantee if you had a million page site based directly around real ecommerce products you would make good money if it stays up
|
|
|
|
have more than a half million product urls (which is really the hard part with amazon, they make it extremely difficult for scrapers to crawl their entire site). after cleaning up this list and potentially trying to get even more products, i will continue to modify my php scraper, this time with use for amazon. it rotates through proxies and user agents so it has worked well in google maps, yelp,. and your university's student directories, so it should bypass amazons no problem. my scraper nowadays saves all the data into xml so i can import through certain plugins, but also have a super easy way to convert to any form i need. originally my scraper rotated through tor proxies and saved all data directly into mysql, over time i created sql files for importing and now that wordpress is used so extensively and doesnt recieve penalties in the search engine like it used to, i can just throw all the data in there and make as many copies and variations of the sites as i want. and make it loo...
This post is a comment.
|
|
|
|
So I scraped 450k amazon product urls, and now i finally finished writing my scraper and finally kicked it off with some fresh proxies. downloading massive amounts of data and images from the big A hole
|
|
|
|
Citizen Science Task: Come up with a color to match the crayon name!
Procedure:
1. Open up a color picker, for example, https://colorpicker.me/ or https://color.adobe.com/. 2. For each item in the numbered list: read them crayon names in list below and picture the color it describes. 3. Find that color in from your mind on your color picker and aim for high precision. ...
|
|
|
|
Internal Facebook Note: Here Is A “Psychological Trick” To Target Teens
At tbh, we built 15 products during the five years of our company. We had many painful lessons about product development that led us to design a systematic method of launching and testing new apps. The purpose of sharing these tactics is to provide guidance for developing products at Facebook—specifically ones that have not reached product-market fit yet.
1. Create a reproducible process of penetrating communities ...
|
|
|
|
found a list online about 300mb of ASINs. can use these to scrape more. shh. been giving away all my trade secrets on this project. just for u thinklynx.
This post is a comment.
|
|
|
|
Referring a user to amazon through your affiliate link gets you 24 hours to 90 days tracking cookie where you can earn commission on anything the user purchases in the time period from Amazon, the biggest online store in the world. 1 million product pages will bring long tail search traffic and careful analytics will reveal the most promising niches/products which new hyper focused niche sites can be created around.
This post is a comment.
|
|
|
|
Periodic Table Turns 150 Years Old
The Economist tells the story of how French chemist Antoine-Laurent de Lavoisier came to publish the first putatively comprehensive list of chemical elements -- substances incapable of being broken down by chemical reactions into other substances -- known today as the periodic table. It was Lavoisier and his wife Marie-Anne who pioneered the technique of measuring quantitatively what went into and came out of a chemical reaction, as a way of getting to the heart of what such a reaction really is. "Where the story of the periodic table of the elements really starts is debatable," reports The Economist, "but Lavoisier's laboratory is as good a place as any to begin..." Here's an excerpt from the report: ...
|
|
|
|
i was looking for a short list of tips for something specific and found an article, top 3 ways to improve XYZ
they were dumb as follows.
1. find out whats broken 2. fix whats broken 3. master xyz
...
|
|