Last modified: 2014-08-08 21:04:20 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T71309, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 69309 - Prop=extracts does not return links
Prop=extracts does not return links
Status: RESOLVED WONTFIX
Product: MediaWiki extensions
Classification: Unclassified
TextExtracts (Other open bugs)
unspecified
All All
: Unprioritized normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-08-08 19:35 UTC by dyoung287
Modified: 2014-08-08 21:04 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description dyoung287 2014-08-08 19:35:13 UTC
I would like to return wikipedia summaries only (without having to download an entire wikipedia page) however none of these options include the links in the text,only html text formatting is included with the text.
Comment 1 Rainer Rillke @commons.wikimedia 2014-08-08 19:44:54 UTC
So, for exmaple, /w/api.php?action=query&prop=extracts&format=json&exsentences=3&exlimit=10&exintro=&titles=Therion returns

{
    "query": {
        "pages": {
            "1590770": {
                "pageid": 1590770,
                "ns": 0,
                "title": "Therion",
                "extract": "<p><b>Therion</b>, Greek for \"wild animal\" or \"beast\" (\u03b8\u03b7\u03c1\u03af\u03bf\u03bd), may refer to:</p>"
            }
        }
    }
}

where the word "Greek" is not a link, although it's a link on https://en.wikipedia.org/wiki/Therion

It would be cool if there would be a switch to specify which HTML elements to return; e.g. links/no links.

That would be a useful feature for "Word-sense disambiguation". Otherwise the action=parse has to be used.
Comment 2 Max Semenik 2014-08-08 21:02:04 UTC
Well, extracts are supposed to be extracts - ot much will be left to extract if we will preserve links too, so if you need full HTML, use action=parse.
Comment 3 Rainer Rillke @commons.wikimedia 2014-08-08 21:04:20 UTC
(In reply to Max Semenik from comment #2)
I recommended action=render of index.php -- is there another way to get just the introduction (with links)? It's about saving bandwidth.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links