Wikipedia import

Wikipedia import

Script for import wikipedia pages

Script /scripts/db_updates/import_from_wikipedia_pages.py import wikipedia objects from specified directory to the database(collection = imported_objects). Name of the database and directory with wikipedia pages are the two parameters of the script.

Example of running script: python import_from_wikipedia_pages.py lido import_dir.
For every wikipedia page script outputs full object, which will be stored in database(collection = imported_objects)

Online-museum-crawler (https://bitbucket.org/osll/museums-crawlers/overview)

2 New Modes:

  • save specified wiki page by title. Keys: -p -t "page title" -L "ru/en". For example:  java -jar ./build/jar/online-museum-crawler -p -t "Агеев, Александр Владимирович" -L "ru"
  • save pages from category and all subcategories. Keys: -n -C "category name" -L "ru/en". For example:  java -jar ./build/jar/online-museum-crawler -n -C "Республика Карелия" -L "ru"