|
|
|
|
oih yes. also, now a days i run my scripts from a server or even my localhost through WGET and remove the output i use for testing. also another reason i use xml and import into wordpress is because they can manage a database of that sizes efficiency way better than i can. i tried to make a million page site a long time ago and it would take for ever to load my data i put in mysql directly off the scraper |
|
|
|
|
cauz |
May 13, 2017, 11:27 a.m. |
|
|
|
James Joyce |
I asked him with my eyes to ask again yes and then he asked me would I yes to say yes my mountain flower and first I put my arms around him yes and drew him down to me so he could feel my breasts all perfume yes and his heart was going like mad and yes I said yes I will Yes. |
Thomas R. Insel |
A National Database on Autism Research is fostering sharing of data and collaborations. Scientists are also making great strides at the interface of biology and engineering with new technologies that are laying the groundwork for future advances. |
Steve Ballmer |
So, I think the output of our innovation is great. We have a culture of self-improvement. I know we can continue to improve. There is no issue. But at the same time, our absolute level of output is fantastic. |
Tim Jackson |
Productivity - the amount of output delivered per hour of work in the economy - is often viewed as the engine of progress in modern capitalist economies. Output is everything. Time is money. The quest for increased productivity occupies reams of academic literature and haunts the waking hours of C.E.O.s and finance ministers. |
Mike Davidson |
Our old site did not have very good support for the disabled, but our new site should soon have much better support. With all of our content in divs now, we can hide all but the relevant chunks of content and navigation with a simple alternate CSS file. |
James Balog |
You know, we humans are programmed to think that big changes on the Earth happened a long time ago, or will happen a long time in the future. What we don't realize is that they actually can happen right now. Right here, right now, while we're alive, in our own hours and days and months and years. |
Stephen Hawking |
If the rate of expansion one second after the Big Bang had been smaller by even one part in a hundred thousand million million, it would have recollapsed before it reached its present size. On the other hand, if it had been greater by a part in a million, the universe would have expanded too rapidly for stars and planets to form. |
Jay Inslee |
Back in the mid-1970s, we adopted some fairly ambitious goals to improve efficiency of our cars. What did we get? We got a tremendous boost in efficiency. |
Georgia May Jagger |
I think my gap adds character. A while ago, on the street, a guy yelled, 'You could stick a gold through your front teeth!' Which meant I could put a £1 coin between them. But you can't. I've tried! Fifty-pence coins and 2-pence coins, yes. But not a pound. |
Helge Ingstad |
It was very clear that this was a very, very old site. There were remains of sod walls. Fishermen assumed it was an old Indian site. Bu Indians didn't use that kind of buildings and houses. |
|
|
have more than a half million product urls (which is really the hard part with amazon, they make it extremely difficult for scrapers to crawl their entire site). after cleaning up this list and potentially trying to get even more products, i will continue to modify my php scraper, this time with use for amazon. it rotates through proxies and user agents so it has worked well in google maps, yelp,. and your university's student directories, so it should bypass amazons no problem. my scraper nowadays saves all the data into xml so i can import through certain plugins, but also have a super easy way to convert to any form i need. originally my scraper rotated through tor proxies and saved all data directly into mysql, over time i created sql files for importing and now that wordpress is used so extensively and doesnt recieve penalties in the search engine like it used to, i can just throw all the data in there and make as many copies and variations of the sites as i want. and make it loo...
This post is a comment.
|
|
|
|
In the previous post on sqlmap basics we learnt how to use sqlmap to hack a vulnerable web application and fetch the list of databases, tables, columns and data rows. In this post we shall see how to do some simple fingerprinting on the remote database to find valuable information that can be used to assist in further exploitation of a system.
So lets say we have a vulnerable url
http://loca...
This post is a comment.
|
|
|
|
now that my list of product ids is in the millions and ive used about 40gb of proxy bandwidth scraping maybe 50k pages from that data, i have to carefully weigh out how much i want to spend on proxies (spent about $30) on this experiment that could result in just a simple takedown notice to stop the method. granted i can always reuse and modify this data. but i guarantee if you had a million page site based directly around real ecommerce products you would make good money if it stays up
|
|
|
|
Today I want to talk about a large DDOS attack that leveraged thousands of unsuspecting WordPress websites as indirect source amplification vectors.
Any WordPress site with Pingback enabled (which is on by default) can be used in DDOS attacks against other sites. Note that XMLRPC is used for pingbacks, trackbacks, remote access via mobile devices and many other features you?re likely very fond of. But, it can also be heavily misused like what we are seeing.
The story
...
This post is a comment.
|
|
|
|
Scraping Every Product on Amazon to Make a Million Page Affiliate Site
|
|
|
|
sooo my boss installed a patch for our mail server. instructions said to create a back up. he didnt. it fucked up the whole email server so now none of our clients have email. then he decides he wants to move it to a new server. he can barely figure out how to EXTRACT A ZIP FILE. now he says he has a meeting and i have to fix all this and import it all to a brand new server. sweetnessss
|
|
|
|
Walmart Patents Cart That Reads Your Pulse, Temperature (vice.com) 114
Walmart recently applied to patent biometric shopping handles that would track a shopper's heart rate, palm temperature, grip force, and walking speed. "The patent, titled 'System And Method For A Biometric Feedback Cart Handle' and published August 23, outlines a system where sensors in the cart send data to a server," reports Motherboard. "That server then notifies a store employee to check on individual customers." From the report: Over time, the server can build a database of data compared against store location and stress response, the patent says -- potentially valuable information for store planning. Other uses o...
|
|
|
|
The 'send on enter' key needs to be in the session so that I don't have to click it every time I load a page.
This post is a comment.
|
|
|
|
To perform a man-on-the-side attack, the NSA observes a target?s Internet traffic using its global network of covert ?accesses? to data as it flows over fiber optic cables or satellites. When the target visits a website that the NSA is able to exploit, the agency?s surveillance sensors alert the TURBINE system, which then ?shoots? data packets at the targeted computer?s IP address within a fraction of a second.
In one man-on-the-side technique, codenamed QUANTUMHAND, the agency disguises itself as a fake Facebook server. When a target attempts to log in to the social media site, the NSA transmits malicious data packets that trick the target?s computer into thinking they are being sent from the real Facebook. By concealing its malware within what looks like an ordinary Facebook page, the ...
This post is a comment.
|
|
|
|
Linux.org's DNS Got Hijacked
Linux.org reports: Wednesday afternoon around 5pm EST someone was able to get into the registrar account for our domain and point DNS to another server -- as well as lock us out from changing it. They pointed the domain name to a pretty rude page for most of the evening until Cloudflare stepped in and blocked the domain for us.
After a lot of back and forth with our registrar, we were able to get things back under our control. I'd like to point out that our serve...
|
|