Last modified: 2014-07-25 22:49:26 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T70577, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 68577 - GWToolset should report that it skipped a file because it was a duplicate
GWToolset should report that it skipped a file because it was a duplicate
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
GWToolset (Other open bugs)
unspecified
All All
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-07-25 15:47 UTC by Jean-Fred
Modified: 2014-07-25 22:49 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Jean-Fred 2014-07-25 15:47:14 UTC
According to bug 68285, “if there are duplicates, GWToolset would unfortunately fail silently at the moment.”

What would fail exactly?
1/ the all upload
2/ the thread
3/ just the file

I don’t think 1 & 2 are acceptable: duplicated files are to be expected (Wikimedians routinely scour online digital collections of GLAMs to import the very specific files needed for their purpose). At worst the duplicated file should be skipped without interfering with the rest of the upload.
Comment 1 dan 2014-07-25 17:45:09 UTC
3/ just the file

the "failure", is really a skip over the upload of that specific file. the output to the runJobs log indicates the title of the file that already exists on commons, which is http://commons.wikimedia.org/wiki/File:Regard_du_Chaudron_01.jpg. for the following example:

2014-07-25 17:39:11 gwtoolsetUploadMediafileJob User:Dan-nl/GWToolset/Mediafile_Batch_Job/53d2963e2d3d1 options=array(3) whitelisted-post=array(48) user-name=Dan-nl user-options=array(25) t=1586 error=GWToolset\Jobs\UploadMediafileJob::run: <An identical media file already exists under the title "File:Regard du Chaudron 01.jpg".>
original URL: http://www.culture.gouv.fr/Wave/image/memoire/0342/sap01_mh00037929_p.jpg
evaluated URL: http://www.culture.gouv.fr/Wave/image/memoire/0342/sap01_mh00037929_p.jpg
Array
(
    [categories] => GWToolset Batch Upload
    [gwtoolset-category-phrase] => Array
        (
            [0] => 
        )

    [gwtoolset-category-metadata] => Array
        (
            [0] => category
        )

    [gwtoolset-detect-license] => 
    [comment] => 
    [gwtoolset-global-license] => 
    [gwtoolset-mediafile-throttle] => 10
    [gwtoolset-mediawiki-template-name] => MH_IDF-Ingestion
    [gwtoolset-metadata-file-url] => 
    [gwtoolset-metadata-file-relative-path] => 2/e/1/2e138b3644aafb47f439e0b20d792420.xml
    [gwtoolset-metadata-file-sha1] => 1zzuc76qckcndlmhzsatcvz0nuf53dx
    [gwtoolset-partner-template-url] => 
    [preview] => 
    [gwtoolset-record-begin] => 1
    [gwtoolset-record-count] => 6
    [gwtoolset-record-current] => 1
    [gwtoolset-record-element-name] => record
    [save-as-batch-job] => 1
    [gwtoolset-title] => Array
        (
            [0] => commons_title
        )

    [gwtoolset-reupload-media] => 
    [gwtoolset-reverse-creator] => 
    [gwtoolset-wrap-creator] => 
    [gwtoolset-wrap-institution] => 
    [gwtoolset-wrap-language] => 
    [gwtoolset-url-to-the-media-file] => Array
        (
            [0] => _url
        )

)



the batch job then continues to run through any additional files that may be part of the batch, however, additional duplicates will "fail" in a similar manner.

ideally, in the future, GWToolset will use a process to gather these runJob messages and report them to the user who initiated the upload.
Comment 2 Jean-Fred 2014-07-25 22:49:26 UTC
That’s good to hear, thanks. I’ll repurpose this bug for the warning then.

(then the duplicated file problem does not explain why my entire batch upload failed as described in bug 68285 ;-)

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links