Skip to content

zhra-jm/trthaber-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

trthaber-crawler

A crawler that contains news data from trthaber.

Build With

  • sqlite

  • python

How To Use

  • at first you should install virtualenv

apt-get install python3-virtualenv

  • then create a new virtual environment

virtualenv -p python3 venv

  • then we have to activate it

. venv/bin/activate

  • now we got to install the requirements

pip install the requirements

  • first of all we have to create tables:

python3 main.py "create_tables"

  • to run the projects first of all we have to find the links we want to crawl so you have to run:

python3 main.py "get_links"

  • now we got the links in our db now we should crawl the links:

python3 main.py "get_articles"

  • now the crawl is done and you have pure data in case you want to see status run:

python3 main.py "stats"

  • Wish you have enjoyed that!Good Luck:D

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages