diff options
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 13 |
1 files changed, 0 insertions, 13 deletions
diff --git a/README.md b/README.md deleted file mode 100644 index 9cdc510..0000000 --- a/README.md +++ /dev/null @@ -1,13 +0,0 @@ -# Introduction -This crawler gets all important information and all links of a website and writes the links to a queue. -After it has finished the information gathering, it will go on by using the first url of the queue and it will start again. - -# Using the crawler -1. Create a mysql database: `mysql -u username -p` and `CREATE DATABASE database_name;` -2. Import the `database.sql` file into your database with `mysql -u username -p database_name < database.sql` -3. Edit `database/mysql_conf.inc` according to your databases credentials -4. Run `cd crawler && php crawler.php http://dmoztools.net/` (or any other domain) -5. For future runs, just execute `cd crawler && php crawler.php` without any arguments and it will automatically - start with the first url of the queue -6. Finished! -
\ No newline at end of file |