2007-08-31

Crazy idea of the day

Today one crazy idea cross my mind: if there is a site that use some sort of query to provide us with information, then I can use a little proggye to create a list of links to all the possible queries and then forcing the download using a robot like httrack. In this way, there is the possibility to pull out the data from some databases. The problem that arise is that your download may be considered an intrusion or an attack (like a dos attack). From what concerns me, it is not illegal, because when someone put something publicly available on the net, everyone can access that information, therefore if I want everything then that's those servers for, right? In fact Google spiders do the same thing: read whatever they can access.

Terms used:
  • query: a request for information, usually to a search engine or a database
  • proggye: a little program that usually doesn't have a visual interface
  • httrack: a program that allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer; can be found here
  • dos attack: it means denial of service attack and it occurs when something or someone requests a huge amount of information in a limited period of time, in this way making the service unvailable to others or slowing down the service
  • spider: a special program that crawls the web for information

Niciun comentariu: