Last modified: 2014-10-17 03:28:37 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T73996, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 71996 - Find a way to make global item ids
Find a way to make global item ids
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
WikidataRepo (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Wikidata bugs
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-10-13 14:37 UTC by Bene*
Modified: 2014-10-17 03:28 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Bene* 2014-10-13 14:37:10 UTC
When we have different WikibaseRepo installations on several wikis they all have their own items and item ids. These should be globally unique and identify the wiki they belong to because they might also be used within each other wiki.

Example: commons:File:xxx.jpg uses wikidata:Q123 as author

There are several options how this can be implemented:

using different prefixes (Q => Wikidata, C => Commons, Z => Wikiquote)
+ item ids do not get more complicated
- you need the mapping of prefixes to wikis on every place you handle item ids
- the mapping only works for wikimedia projects but third party users might have same prefixes for different things

adding the domain/interwiki prefix to the item ids (wikidata:Q123 or d:Q123)
+ creates globally unique identifiers in the wikimedia projects
- hard to read and not "handy", also takes more space/characters

using full urls for item ids (www.wikidata.org/entity/Q123)
+ creates globally unique identifiers throughout the web
- hard to read and not "handy", takes even more space/characters
Comment 1 Jan Zerebecki 2014-10-13 15:13:33 UTC
We probably want to have a combination of unique ID prefix in wikimedia projects and URLs that can be used by the rest of the (semantic) web.
Comment 2 Jeroen De Dauw 2014-10-17 03:07:33 UTC
I know I have discussed this with Markus before, since it was relevant for WDTK. Can't find where this was though. I've poked Markus now, since I want his input.
Comment 3 Jeroen De Dauw 2014-10-17 03:28:37 UTC
> using different prefixes (Q => Wikidata, C => Commons, Z => Wikiquote)

Nope, not going back to configurable prefixes :) Guess you where not involved yet at the point where they caused to much pain. It has some further drawbacks that you did not list:

- Not globally unique (can have more Wikibase Repo instances than there are letters)
- Item IDs are no longer easily recognizable as such (similar to: if it's 35 characters and starts with a "1", then I can guess it's a BTC address)
- The prefixes will be rather obscure. If I see "Z123", how would I derive "Wikiquote"?

We do of course not need truely global IDs everywhere. And we have plently of locations where changing the current IDs would be a huge pain. As well as needing BC for certain things for forever. Which I hope is enough to establish that changing the existing IDs we have and forcing the change to happen everywhere is a bad idea. Rather than making such a modification, we need to add new support for global IDs, and switch to using that where appropriate. Which still leaves the question of what the global IDs should look like.

> using full urls for item ids (www.wikidata.org/entity/Q123)

If you need IDs global to something more than your own Wikibase Repository installation, then I think it makes little sense to just go part of the way and not make it globally unique altogether. So I much preffer this approach over wikidata:Q123. That is also what WDTK already does if I'm not mistaken. Using the half-way approach in public interfaces seems especially bad. I can imagine that it might be slightly better in some specific internal cases, which however does not make it generally suitable.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links