Last modified: 2012-06-20 10:26:45 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T39242, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 37242 - English Wikipedia xml dump Category Stomach missing
English Wikipedia xml dump Category Stomach missing
Status: RESOLVED INVALID
Product: Datasets
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Unprioritized normal (vote)
: ---
Assigned To: Ariel T. Glenn
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2012-05-31 11:16 UTC by Akshaya ATM
Modified: 2012-06-20 10:26 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Akshaya ATM 2012-05-31 11:16:07 UTC
In wikipedia XML dump downloaded, the pages in the category eg:(cardia,stomach) stomach are found to be missing the category tag. but found to be categorized in the online resource.And also no category:stomach page exists in the english wikipedia XML dump

BUG: MISSING CATEGORY stomach TAG in all the pages belonging to that category

example of pages missing the category: cardia, stomach.

NO category:stomach page available in english wikipedia xml dump.
Comment 1 Andre Klapper 2012-06-02 16:30:04 UTC
(In reply to comment #0)
> In wikipedia XML dump downloaded

Exact URL welcome to reproduce.
Comment 2 Akshaya ATM 2012-06-03 08:34:17 UTC
download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2
this is URL of the dump. 
thanks
Comment 3 Ariel T. Glenn 2012-06-20 10:26:45 UTC
The category was added on May 12 2012: see http://en.wikipedia.org/w/index.php?title=Category:Stomach&action=history

The dump was generated on May 2: see http://dumps.wikimedia.org/enwiki/20120502/ for date and timestamps of all files.

The new dumps have the category in them as expected.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links