Skip to content

Latest commit

 

History

History
31 lines (20 loc) · 934 Bytes

README.md

File metadata and controls

31 lines (20 loc) · 934 Bytes

htmldump

Dump scraped HTML from MongoDB to a bzip2-compressed tar file archive.

Requirements

  • Python 2.7
  • Dependencies from requirements.txt

Usage

htmldump.py -H opented.org -u USERNAME -p PASSWORD OUTPUT [DOC-RE]

See htmldump.py -h for details.

License

Copyright 2012 Joost Cassee / OpenTED

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.