Last modified: 2014-07-25 22:49:26 UTC
According to bug 68285, “if there are duplicates, GWToolset would unfortunately fail silently at the moment.” What would fail exactly? 1/ the all upload 2/ the thread 3/ just the file I don’t think 1 & 2 are acceptable: duplicated files are to be expected (Wikimedians routinely scour online digital collections of GLAMs to import the very specific files needed for their purpose). At worst the duplicated file should be skipped without interfering with the rest of the upload.
3/ just the file the "failure", is really a skip over the upload of that specific file. the output to the runJobs log indicates the title of the file that already exists on commons, which is http://commons.wikimedia.org/wiki/File:Regard_du_Chaudron_01.jpg. for the following example: 2014-07-25 17:39:11 gwtoolsetUploadMediafileJob User:Dan-nl/GWToolset/Mediafile_Batch_Job/53d2963e2d3d1 options=array(3) whitelisted-post=array(48) user-name=Dan-nl user-options=array(25) t=1586 error=GWToolset\Jobs\UploadMediafileJob::run: <An identical media file already exists under the title &quot;File:Regard du Chaudron 01.jpg&quot;.> original URL: http://www.culture.gouv.fr/Wave/image/memoire/0342/sap01_mh00037929_p.jpg evaluated URL: http://www.culture.gouv.fr/Wave/image/memoire/0342/sap01_mh00037929_p.jpg Array ( [categories] => GWToolset Batch Upload [gwtoolset-category-phrase] => Array ( [0] => ) [gwtoolset-category-metadata] => Array ( [0] => category ) [gwtoolset-detect-license] => [comment] => [gwtoolset-global-license] => [gwtoolset-mediafile-throttle] => 10 [gwtoolset-mediawiki-template-name] => MH_IDF-Ingestion [gwtoolset-metadata-file-url] => [gwtoolset-metadata-file-relative-path] => 2/e/1/2e138b3644aafb47f439e0b20d792420.xml [gwtoolset-metadata-file-sha1] => 1zzuc76qckcndlmhzsatcvz0nuf53dx [gwtoolset-partner-template-url] => [preview] => [gwtoolset-record-begin] => 1 [gwtoolset-record-count] => 6 [gwtoolset-record-current] => 1 [gwtoolset-record-element-name] => record [save-as-batch-job] => 1 [gwtoolset-title] => Array ( [0] => commons_title ) [gwtoolset-reupload-media] => [gwtoolset-reverse-creator] => [gwtoolset-wrap-creator] => [gwtoolset-wrap-institution] => [gwtoolset-wrap-language] => [gwtoolset-url-to-the-media-file] => Array ( [0] => _url ) ) the batch job then continues to run through any additional files that may be part of the batch, however, additional duplicates will "fail" in a similar manner. ideally, in the future, GWToolset will use a process to gather these runJob messages and report them to the user who initiated the upload.
That’s good to hear, thanks. I’ll repurpose this bug for the warning then. (then the duplicated file problem does not explain why my entire batch upload failed as described in bug 68285 ;-)