[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#710584: ITP: mediawiki-mwxml2sql -- Tools to help import MediaWiki XML dumps into database



Package: wnpp
Severity: wishlist
Owner: wpmirrordev <wpmirrordev@gmail.com>

* Package name    : mediawiki-mwxml2sql
  Version         : 0.0.2
  Upstream Author : Ariel T. Glenn <ariel@wikimedia.org>
* URL             : https://gerrit.wikimedia.org/r/#/admin/projects/operations/dumps
* License         : GPL-2.0+
  Programming Lang: C
  Description     : Tools to help import MediaWiki XML dumps into database

 Tools for converting MediaWiki XML dumps into a format that can be rapidly
 loaded into a local instance of MediaWiki. 
 
 o mwxml2sql - convert MediaWiki XML dump file into MySQL INSERT commands 
 
 o sql2txt   - convert MySQL INSERT commands to LOAD DATA INFILE commands 
 
 o sqlfilter - filter MySQL INSERT statements with fields matching values 
 
 The Wikimedia Foundation offers wikipedias and wiktionaries in nearly 300
 languages.  About once a month, the WMF publishes dump files of each wiki in
 XML format.  The WMF also publishes a software, MediaWiki, that anyone may use
 to create their own wiki.  MediaWiki can also be used to create a local mirror.
 To build a mirror, one must import an XML dump file into a local instance of
 MediaWiki. These tools are faster than those published with MediaWiki (much
 faster than importDump.php).  Speed is important for mirroring the largest
 wikis such as the `enwiki'.


Reply to: