Last modified: 2014-11-16 00:09:25 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T39591, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 37591 - History of a page was lost after several move actions on zhwp
History of a page was lost after several move actions on zhwp
Status: NEW
Product: Wikimedia
Classification: Unclassified
General/Unknown (Other open bugs)
wmf-deployment
All All
: Low critical with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
https://zh.wikipedia.org/w/index.php?...
:
Depends on: 39007 39008
Blocks:
  Show dependency treegraph
 
Reported: 2012-06-14 13:59 UTC by shi zhao
Modified: 2014-11-16 00:09 UTC (History)
13 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description shi zhao 2012-06-14 13:59:00 UTC
After the move page, all page history lost in zh.wp. see:

https://zh.wikipedia.org/w/index.php?title=Wikipedia:%E7%89%B9%E8%89%B2%E6%9D%A1%E7%9B%AE%E5%80%99%E9%80%89&action=history

https://zh.wikipedia.org/w/index.php?title=Wikipedia:%E7%89%B9%E8%89%B2%E6%9D%A1%E7%9B%AE%E8%AF%84%E9%80%89&action=history

The Page is FAC page in zh.wp, now just restore 3 revision.

Plese help fix it!
Comment 1 Peter Bena 2012-06-14 14:02:09 UTC
this bug may need to be solved on db level +shell
Comment 2 Liangent 2012-06-14 14:10:08 UTC
Here're some check on Toolserver (with big replag):

$ sql zhwiki_p
Reading table information for completion of table and column names
You can turn off this feature to get a quicker startup with -A

Welcome to the MySQL monitor.  Commands end with ; or \g.
Your MySQL connection id is 88023986
Server version: 5.1.53 Source distribution

Copyright (c) 2000, 2010, Oracle and/or its affiliates. All rights reserved.

Oracle is a registered trademark of Oracle Corporation and/or its
affiliates. Other names may be trademarks of their respective
owners.

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.

mysql> select * from page where page_title='特色条目候选' and page_namespace=4;
+---------+----------------+--------------------+-------------------+--------------+------------------+-------------+----------------+----------------+-------------+----------+
| page_id | page_namespace | page_title         | page_restrictions | page_counter | page_is_redirect | page_is_new | page_random    | page_touched   | page_latest | page_len |
+---------+----------------+--------------------+-------------------+--------------+------------------+-------------+----------------+----------------+-------------+----------+
|   46728 |              4 | 特色条目候选 |                   |            0 |                0 |           0 | 0.464639958211 | 20120606104339 |    21416960 |     3421 |
+---------+----------------+--------------------+-------------------+--------------+------------------+-------------+----------------+----------------+-------------+----------+
1 row in set (0.01 sec)

mysql> select * from revision where rev_page=46728 order by rev_id desc limit 10;
+----------+----------+-------------+-----------------------------------------------------------+----------+---------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
| rev_id   | rev_page | rev_text_id | rev_comment                                               | rev_user | rev_user_text | rev_timestamp  | rev_minor_edit | rev_deleted | rev_len | rev_parent_id | rev_sha1                        |
+----------+----------+-------------+-----------------------------------------------------------+----------+---------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
| 21416960 |    46728 |    21613695 | consistency                                               |   775182 | Cravix        | 20120531141052 |              1 |           0 |    3421 |      21345461 | 0cu0xuhfnyfcgsfwowsww8qg7byyg6m |
| 21345461 |    46728 |    21539076 | /* 提名區 */                                           |   642079 | Lixihan       | 20120525092802 |              0 |           0 |    3363 |      21269220 | 04oty0im3wcbdarcd804v3ziwci7hbf |
| 21269220 |    46728 |    21458630 |                                                           |   522281 | Waihorace     | 20120519061839 |              0 |           0 |    3324 |      21269208 | tlo1y3elpf11bw27y911rcvp9bwdp0v |
| 21269208 |    46728 |    21458618 |                                                           |   522281 | Waihorace     | 20120519061755 |              0 |           0 |    3142 |      20815355 | 3s3656gdzau9xach8ppqbd7bs4qz7kd |
| 20815355 |    46728 |    20986838 |                                                           |  1047411 | 风雨同舟  | 20120419084310 |              0 |           0 |    3141 |      20813643 | c7gul0l1nnqnwlt6u3nx2j9zktix8ro |
| 20813643 |    46728 |    20984884 |                                                           |   441687 | Ai6z83xl3g    | 20120419054402 |              0 |           0 |    3184 |      19822979 | 6c9ow3fbiwbx62tdbmpo9nibbdw9iap |
| 19822979 |    46728 |    19976799 |                                                           |   441687 | Ai6z83xl3g    | 20120405051116 |              0 |           0 |    3233 |      19807560 | pn0lkty2amgxmwu6pavsu5jw9keol65 |
| 19807560 |    46728 |    19960395 | {{Wikipedia:特色條目候選/苏州市}}:進行評選 |   959311 | Mouse20080706 | 20120403231200 |              0 |           0 |    3184 |      19804982 | 6c9ow3fbiwbx62tdbmpo9nibbdw9iap |
| 19804982 |    46728 |    19957728 |                                                           |  1106366 | Huhaoyu321    | 20120403175655 |              0 |           0 |    3142 |      19804894 | 201a73ud6onvuoqo080az0bqwznguqn |
| 19804894 |    46728 |    19957638 |                                                           |  1106366 | Huhaoyu321    | 20120403175211 |              0 |           0 |    3184 |      18126033 | 6c9ow3fbiwbx62tdbmpo9nibbdw9iap |
+----------+----------+-------------+-----------------------------------------------------------+----------+---------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
10 rows in set (0.02 sec)

mysql> Bye

$ curl "http://zh.wikipedia.org/w/index.php?oldid=19804894&action=render&uselang=en"
<p>The database did not find the text of a page that it should have found, named "Wikipedia:首页" (revision#: 19804894).</p>
<p>This is usually caused by following an outdated diff or history link to a page that has been deleted.</p>
<p>If this is not the case, you may have found a bug in the software. Please report this to an <a href="//zh.wikipedia.org/wiki/Special:%E7%94%A8%E6%88%B7%E5%88%97%E8%A1%A8/sysop" title="Special:用户列表/sysop">administrator</a>, making note of the URL.</p>


<!-- 
NewPP limit report
Preprocessor node count: 1/1000000
Post-expand include size: 0/2048000 bytes
Template argument size: 0/2048000 bytes
Highest expansion depth: 1/40
Expensive parser function count: 0/500
-->
Comment 3 Liangent 2012-06-14 14:10:56 UTC
Someone may check where these rev_id's and rev_sha1's went on WMF cluster or after TS catch up.
Comment 4 Peter Bena 2012-06-14 14:18:36 UTC
What is name of original page
Comment 5 Liangent 2012-06-14 14:20:29 UTC
(In reply to comment #4)
> What is name of original page

See my SQL above.
Comment 6 Liangent 2012-06-14 14:21:53 UTC
This is just some note.

mysql> select * from page where page_title='特色条目评选' and page_namespace=4;
+---------+----------------+--------------------+-------------------+--------------+------------------+-------------+----------------+----------------+-------------+----------+
| page_id | page_namespace | page_title         | page_restrictions | page_counter | page_is_redirect | page_is_new | page_random    | page_touched   | page_latest | page_len |
+---------+----------------+--------------------+-------------------+--------------+------------------+-------------+----------------+----------------+-------------+----------+
|  291154 |              4 | 特色条目评选 |                   |            0 |                1 |           0 | 0.429395508603 | 20120531141052 |     2353742 |       42 |
+---------+----------------+--------------------+-------------------+--------------+------------------+-------------+----------------+----------------+-------------+----------+
1 row in set (0.02 sec)

mysql> select * from revision where rev_page=291154 limit 10;
+---------+----------+-------------+------------------------------------------------------------------------------+----------+-----------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
| rev_id  | rev_page | rev_text_id | rev_comment                                                                  | rev_user | rev_user_text   | rev_timestamp  | rev_minor_edit | rev_deleted | rev_len | rev_parent_id | rev_sha1                        |
+---------+----------+-------------+------------------------------------------------------------------------------+----------+-----------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
| 1977486 |   291154 |     1955773 | [[Wikipedia:特色条目评选]]已移动到[[Wikipedia:特色条目候选]] |    23701 | 百家姓之四 | 20060523035902 |              0 |           0 |      42 |             0 | ek74pn6yc6sdzdk4q0kmxmbhnoa8g02 |
| 2353701 |   291154 |     2317697 | 重定向到[[Wikipedia:特色條目候選]]                                 |    23701 | 百家姓之四 | 20060802021731 |              1 |           0 |      42 |       1977486 | sk2aazj3a537cmwxq54jox860brntz7 |
| 2353742 |   291154 |     2317736 | 重定向到[[Wikipedia:特色条目候选]]                                 |    23701 | 百家姓之四 | 20060802022403 |              1 |           0 |      42 |       2353701 | ek74pn6yc6sdzdk4q0kmxmbhnoa8g02 |
+---------+----------+-------------+------------------------------------------------------------------------------+----------+-----------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
3 rows in set (0.04 sec)

mysql>
Comment 7 shi zhao 2012-06-14 14:25:31 UTC
(In reply to comment #4)
> What is name of original page

Original page is Wikipedia:特色条目候选
Comment 8 Liangent 2012-06-14 14:31:25 UTC
shizhao, can you describe your exact operating procedure?
Comment 9 Peter Bena 2012-06-14 14:45:11 UTC
when this problem happened? After the last move at 12:56, 14 June 2012? Or some other date? Was it after this move:

12:56, 14 June 2012 Shizhao moved page Wikipedia:特色条目候选 to Wikipedia:特色条目评选 (开始改版)

I can't understand why there is a page which redirects to itself
Comment 10 shi zhao 2012-06-14 14:46:24 UTC
(In reply to comment #8)
> shizhao, can you describe your exact operating procedure?

1. move "wikipedia:特色条目候选" -> "wikipedia:特色条目评选"
2. moving.....
....
....
3.server error page, "We have a technical problem.....blablabla"
4.wait few seconds
5.closed the tab.
6.open page "wikipedia:特色条目评选", now it just a redirect
7.restore history revision
8 end
Comment 11 shi zhao 2012-06-14 14:50:01 UTC
PS:

4.wait few seconds
4.1 refresh the page
4.2 wait few seconds, but not respond
5.closed the tab.
Comment 12 Peter Bena 2012-06-14 14:52:44 UTC
This is indeed a bug, the move wasn't done using interface of mediawiki but some tool, there were many moves done by that user in the very same minute (more than 50 moves). The first one was this page and as you can see, every move consist of two log actions:

1. minor edit on target page, where the page is moved
2. new page creation on original page (redirect string)

in this case only the second occured. So I guess if you delete the page (original name) you should be able to restore the previous version which contains full history. In fact the page was never moved, but overwritten with redirect
Comment 13 Peter Bena 2012-06-14 14:55:24 UTC
it's even more complicated, the page was moved using the interface, however the number of subpages which were moved together was so large that mw probably crashed on execution timeout and left it unfinished
Comment 14 shi zhao 2012-06-14 14:59:24 UTC
(In reply to comment #13)
> it's even more complicated, the page was moved using the interface, however the
> number of subpages which were moved together was so large that mw probably
> crashed on execution timeout and left it unfinished

maybe
Comment 15 Peter Bena 2012-06-14 15:22:55 UTC
Last revision on that page was 21416960, text 21613695

selecting this data from database would help you recover the content of latest revision
Comment 16 shi zhao 2012-06-14 15:36:03 UTC
I temp rewrite the page "wikipedia:特色条目评选". History revision no restore.
Comment 17 shi zhao 2012-06-14 15:38:06 UTC
(In reply to comment #15)
> Last revision on that page was 21416960, text 21613695
> 
> selecting this data from database would help you recover the content of latest
> revision

Can't get page content (text 21613695) from toolserver database.
Comment 18 Liangent 2012-06-14 18:55:25 UTC
(In reply to comment #10)
> (In reply to comment #8)
> > shizhao, can you describe your exact operating procedure?
> 
> 1. move "wikipedia:特色条目候选" -> "wikipedia:特色条目评选"
> 2. moving.....
> ....
> ....
> 3.server error page, "We have a technical problem.....blablabla"
> 4.wait few seconds
> 5.closed the tab.
> 6.open page "wikipedia:特色条目评选", now it just a redirect
> 7.restore history revision
> 8 end

Let's expand it in detail:

0. There was a page Wikipedia:特色条目评选 which was a redirect to Wikipedia:特色条目候选
1a. You try to move Wikipedia:特色条目候选 to Wikipedia:特色条目评选
1b. The system gives you a warning saying that Wikipedia:特色条目评选 already exists and provides you a checkbox to delete it before moving
1c. You tick the checkbox and submit the form again.

Is this correct?

6. When you check Wikipedia:特色条目评选, it's a redirect to Wikipedia:特色条目候选 (the same as this page before moving? did you check page history and what did it say?)
7. Which title did you do this operation on? Does "restore history revision" (only one? what's it?) mean undeleting an archive (=deleted in usual words) revision? If yes, are there any revisions in action=history before undeletion?
Comment 19 shi zhao 2012-06-14 19:20:54 UTC
(In reply to comment #18)
> (In reply to comment #10)
> > (In reply to comment #8)
> > > shizhao, can you describe your exact operating procedure?
> > 
> > 1. move "wikipedia:特色条目候选" -> "wikipedia:特色条目评选"
> > 2. moving.....
> > ....
> > ....
> > 3.server error page, "We have a technical problem.....blablabla"
> > 4.wait few seconds
> > 5.closed the tab.
> > 6.open page "wikipedia:特色条目评选", now it just a redirect
> > 7.restore history revision
> > 8 end
> 
> Let's expand it in detail:
> 
> 0. There was a page Wikipedia:特色条目评选 which was a redirect to Wikipedia:特色条目候选
> 1a. You try to move Wikipedia:特色条目候选 to Wikipedia:特色条目评选
> 1b. The system gives you a warning saying that Wikipedia:特色条目评选 already exists
> and provides you a checkbox to delete it before moving
> 1c. You tick the checkbox and submit the form again.
> 
> Is this correct?

yes, I do it.
> 
> 6. When you check Wikipedia:特色条目评选, it's a redirect to Wikipedia:特色条目候选 (the
> same as this page before moving? did you check page history and what did it
> say?)
When check Wikipedia:特色条目评选, Page does not exist.

> 7. Which title did you do this operation on? Does "restore history revision"
> (only one? what's it?) mean undeleting an archive (=deleted in usual words)
> revision? If yes, are there any revisions in action=history before undeletion?

I restore 3 history revision.
Comment 20 Peter Bena 2012-06-18 12:22:19 UTC
priority +1, no one seems to care about this, added few people with shell to cc
Comment 21 Dereckson 2012-06-18 12:28:24 UTC
Peter Bena, please review the following document:
https://www.mediawiki.org/wiki/Bug_management/Bugzilla_usage

A bug with an 'high' priority means 'should be fixed within the next month'.

You can check the guideliens for highest bugs at the following URL:
https://www.mediawiki.org/wiki/Bug_management/How_to_triage

You'll note there is a notify procedure to use for the highest/critical combination.

Consider also an highest prioritized bug will slowdown general development and ops tasks to fix this, and so is the equivalent of a "red alert".
Comment 22 Liangent 2012-08-03 13:32:07 UTC
(In reply to comment #2)
> mysql> select * from revision where rev_page=46728 order by rev_id desc limit
> 10;
> +----------+----------+-------------+-----------------------------------------------------------+----------+---------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
> | rev_id   | rev_page | rev_text_id | rev_comment                              
>                 | rev_user | rev_user_text | rev_timestamp  | rev_minor_edit |
> rev_deleted | rev_len | rev_parent_id | rev_sha1                        |
> +----------+----------+-------------+-----------------------------------------------------------+----------+---------------+----------------+----------------+-------------+---------+---------------+---------------------------------+
> | 21416960 |    46728 |    21613695 | consistency                              
>                 |   775182 | Cravix        | 20120531141052 |              1 | 
>          0 |    3421 |      21345461 | 0cu0xuhfnyfcgsfwowsww8qg7byyg6m |

Seems relevant revision / archive rows are missing while text is still there:

mysql> select * from archive where ar_rev_id=21416960;
Empty set (0.07 sec)

mysql> select * from archive where ar_timestamp='20120531141052' /* SLOW_OK */;
Empty set (40.81 sec)

mysql> select * from revision where rev_sha1='0cu0xuhfnyfcgsfwowsww8qg7byyg6m' /* SLOW_OK */;
Empty set (16 min 28.74 sec)

mysql> select * from revision where rev_text_id=21613695 /* SLOW_OK */;
Empty set (43 min 54.86 sec)

mysql> select * from text where old_id=21613695;
+----------+---------------+-----------+-----------------------+-------------+----------+---------------+---------------+----------------+---------------------+-------------------+
| old_id   | old_namespace | old_title | old_text              | old_comment | old_user | old_user_text | old_timestamp | old_minor_edit | old_flags           | inverse_timestamp |
+----------+---------------+-----------+-----------------------+-------------+----------+---------------+---------------+----------------+---------------------+-------------------+
| 21613695 |             0 |           | DB://cluster23/398889 |             |        0 |               |               |              0 | utf-8,gzip,external |                   |
+----------+---------------+-----------+-----------------------+-------------+----------+---------------+---------------+----------------+---------------------+-------------------+
1 row in set (0.03 sec)
Comment 23 Sam Reed (reedy) 2013-05-16 21:44:42 UTC
Why do we need this bug? There's bug 39008 requesting an import of history, and then bug 39007 for the actual underlying problem
Comment 24 Liangent 2013-05-17 07:23:08 UTC
(In reply to comment #23)
> Why do we need this bug? There's bug 39008 requesting an import of history,
> and
> then bug 39007 for the actual underlying problem

It might be ok to close bug 39007 and use this one to track the underlying issue. But this one is in Wikimedia product and the problem itself may belong to MediaWiki (where bug 39007 is placed).

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links