Last modified: 2013-11-04 17:26:43 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T48641, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 46641 - Storage for analysis results, term props
Storage for analysis results, term props
Status: RESOLVED INVALID
Product: MediaWiki extensions
Classification: Unclassified
WikidataRepo (Other open bugs)
unspecified
All All
: Unprioritized normal (vote)
: ---
Assigned To: Wikidata bugs
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-03-28 17:00 UTC by jeblad
Modified: 2013-11-04 17:26 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description jeblad 2013-03-28 17:00:49 UTC
In several places we need some kind of storage place for results gathered from heavy analysis. Two (three) of three (four) known places have somewhat working solutions. Those are Items and Properties, and to some extent Queries. For those we can use page props. The one with no solution is terms used in Claims, but it seems like we can make some kind of "term props" that do the same as page props.

It is one thing that is somewhat annoying about this way of doing this. Page props is said to be easy (lightweight) to regenerate but the analysis results are not in general very lightweight, they are actually quite heavy to generate in some cases.

The analysis results for terms are in general terms from one property that are similar to terms from one or more other properties. The results should then be stored in something like a "term props" table. The results are typically employed as ranking during sorting of terms to be used in claims.

The calculation can be done by using a multivariate Bayes and sorting on the resulting a posteriori probability. This is a two-level map-reduce problem where the first part is the Bayes and the second is some kind of dimensionality reduction.

The analysis results for properties are in general properties from one item that are similar to properties from one or more other items. The results in this case can be saved as ordinary page props. Usually the results from the analysis of properties are employed as ranking of properties during creation of claims, if a property is commonly used together with some other property it will be ranked up in the list. Otherwise it is similar to the terms analysis.
Comment 1 Lydia Pintscher 2013-11-04 17:26:43 UTC
Sorry but I have no idea what this is supposed to be about or achieve.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links