Last modified: 2014-03-25 17:51:58 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T63133, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 61133 - Let Internet Archive's Wayback machine archive tools
Let Internet Archive's Wayback machine archive tools
Status: RESOLVED WONTFIX
Product: Wikimedia Labs
Classification: Unclassified
tools (Other open bugs)
unspecified
All All
: Normal enhancement
: ---
Assigned To: Marc A. Pelletier
http://tools.wmflabs.org/robots.txt
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-02-10 11:09 UTC by Nemo
Modified: 2014-03-25 17:51 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Nemo 2014-02-10 11:09:50 UTC
See bug 56893 for instructions.
Comment 1 Tim Landscheidt 2014-02-10 11:24:08 UTC
What does "archive tools" mean?  Tools (for the most part) process input and generate output from that.  A spider doesn't provide input and thus doesn't get output.

For "archiving tools" (i. e. the interesting bit, the processor), their source code needs to be put in a repository.  But neither Internet Archive nor any other spider can access private source code from the web.
Comment 2 Nemo 2014-02-10 11:31:29 UTC
(In reply to comment #1)
> What does "archive tools" mean?  Tools (for the most part) process input and
> generate output from that.  A spider doesn't provide input and thus doesn't
> get
> output.

Which is why this operation is inexpensive but will allow Wayback to archive URLs referenced from the web or by users.
Comment 3 Marc A. Pelletier 2014-03-25 17:50:11 UTC
Pages with dynamically generated content make no semantic sense to archive, and the cost in resources of allowing spidering of tool URLs is prohibitive.

Tool Labs is not intended for long-lived mostly static content (which is what archiving makes sense for); that data belongs on a wiki -- possibly generated and put there /by/ tools.
Comment 4 Nemo 2014-03-25 17:51:58 UTC
RObbish

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links