|
|
|
|
i'm always deleting my data though |
|
|
|
|
|
|
Charles Babbage |
Errors using inadequate data are much less than those using no data at all. |
Stephen Cambone |
There is a reasonable concern that posting raw data can be misleading for those who are not trained in its use and who do not have the broader perspective within which to place a particular piece of data that is raw. |
Sarah Caldwell |
If you approach an opera as though it were something that always went a certain way, that's what you get. I approach an opera as though I didn't know it. |
Anita Elberse |
I spend way too much time watching television, going to sports games, going to movies. It struck me that there's an awful lot of data in the public domain for these sectors. The movie industry publishes weekly sales numbers - not many industries do. |
Arthur Conan Doyle |
It is a capital mistake to theorize before one has data. |
Bernard Ebbers |
Our communications services revenue growth is being driven by continued strong top-line performance in data, Internet and international - three of the fastest growing and most profitable areas within communications services. |
Bernard Ebbers |
Our investments in data, Internet and international have been particularly timely and have positioned the company to post industry-leading incremental revenue gains. |
Bruce Jackson |
The key fact missed most often by social scientists utilizing documentary films for data, is this: documentary films are not found or reported things; they're made things. |
Colin Powell |
Experts often possess more data than judgment. |
Frank Gaffney |
If the area were on or near the U.S. continental shelf, such data could well provide an enemy with strategically invaluable insights into undersea access routes that could be used to attack some of the millions of Americans who live on or near our coasts. |
|
|
Maybe I should make some youtube videos. I would make one about the real concerns of AI, one about basic data science and data analysis, and one that is an introduction to neural networks.
|
|
|
|
i bet i could scrape the images using scrapebox with free proxies to save on costs. the only reason i used paid proxies for the data is because i want to be sure that it's US data to get US results for each product id. and theyre more reliable
This post is a comment.
|
|
|
|
To perform a man-on-the-side attack, the NSA observes a target?s Internet traffic using its global network of covert ?accesses? to data as it flows over fiber optic cables or satellites. When the target visits a website that the NSA is able to exploit, the agency?s surveillance sensors alert the TURBINE system, which then ?shoots? data packets at the targeted computer?s IP address within a fraction of a second.
In one man-on-the-side technique, codenamed QUANTUMHAND, the agency disguises itself as a fake Facebook server. When a target attempts to log in to the social media site, the NSA transmits malicious data packets that trick the target?s computer into thinking they are being sent from the real Facebook. By concealing its malware within what looks like an ordinary Facebook page, the ...
This post is a comment.
|
|
|
|
the old captchas where you typed in strange words were part of a larger book translation scheme. you were typing in what you thought these physical books words were and it compared them to their data or other peoples data to decipher them in pieces. i thought this was for google books but not sure
This post is a comment.
|
|
|
|
now that my list of product ids is in the millions and ive used about 40gb of proxy bandwidth scraping maybe 50k pages from that data, i have to carefully weigh out how much i want to spend on proxies (spent about $30) on this experiment that could result in just a simple takedown notice to stop the method. granted i can always reuse and modify this data. but i guarantee if you had a million page site based directly around real ecommerce products you would make good money if it stays up
|
|
|
|
Amazon Opens Up Its Internal Machine Learning Training To Everyone
Amazon announced Monday that it's making the machine learning courses it uses to train its engineers available to everybody for free. The course is tailored to four major groups -- developers, data scientists, data platform engineers and business professionals -- and it offers both foundational level lessons as well as more advanced instruction.
https://aws.amazon.com/blogs/machine-learning/amazons-own-machine-learning-unive...
|
|
|
|
Netflix Has Saved Every Choice You've Ever Made In 'Black Mirror: Bandersnatch'
According to a technology policy researcher, Netflix records all the choices you make in Black Mirror's Bandersnatch episode. "Michael Veale, a technology policy researcher at University College London, wanted to know what data Netflix was collecting from Bandersnatch," reports Motherboard. "People had been speculating a lot on Twitter about Netflix's motivations," Veale told Motherboard in an email. "I thought it would be a fun test to show people how you can use data protection law to ask real questions you have." From the report: The law Veale used is Europe's General Data Protection Regulation (GDPR). The ...
|
|
|
|
have more than a half million product urls (which is really the hard part with amazon, they make it extremely difficult for scrapers to crawl their entire site). after cleaning up this list and potentially trying to get even more products, i will continue to modify my php scraper, this time with use for amazon. it rotates through proxies and user agents so it has worked well in google maps, yelp,. and your university's student directories, so it should bypass amazons no problem. my scraper nowadays saves all the data into xml so i can import through certain plugins, but also have a super easy way to convert to any form i need. originally my scraper rotated through tor proxies and saved all data directly into mysql, over time i created sql files for importing and now that wordpress is used so extensively and doesnt recieve penalties in the search engine like it used to, i can just throw all the data in there and make as many copies and variations of the sites as i want. and make it loo...
This post is a comment.
|
|
|
|
Senator Introduces Bill That Would Send CEOs To Jail For Violating Consumer Privacy
Oregon Senator Ron Wyden has introduced the Consumer Data Protection Act that "would dramatically beef up Federal Trade Commission authority and funding to crack down on privacy violations, let consumers opt out of having their sensitive personal data collected and sold, and impose harsh new penalties on a massive data monetization industry that has for years claims that self-regulation is all that's necessary to protect consumer privacy," reports Motherboard. From the report: Wyden's bill proposes that companies whose revenue exceeds $1 billion per year -- or warehouse data on more than 50 million consume...
|
|
|
|
Qualtrics doesn't let you force a response on all questions. Pretty lame, right? What am I supposed to do when I generate a survey for crowd-sourcing but I don't want to manually go through hundreds of questions to add validation?
Well, here is the hacky solution. 1. Set validation on one question. 2. Export the survey. This downloads the survey as a QSF file. 3. Open QSF file in a text editor and find the validation you set for that one question. ...
|
|