Last modified: 2014-05-01 18:40:46 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T65729, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 63729 - CirrusSearch: Remove as much non-sentence stuff as possible from article text
CirrusSearch: Remove as much non-sentence stuff as possible from article text
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
CirrusSearch (Other open bugs)
unspecified
All All
: Unprioritized normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-04-09 15:07 UTC by Nik Everett
Modified: 2014-05-01 18:40 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Nik Everett 2014-04-09 15:07:10 UTC
The snippets we generate actually contain stuff from within tables, image captions, and headings.  These don't look great.  If we could smash those into another field then the snippets would be nicer.  We could also use the sentence fragmenter in the experimental highlighter.

Note: we already do this for the headings.  We should do it for tables and infoboxes and stuff.  Maybe we should do it for a css class as well.
Comment 1 Quiddity 2014-04-13 22:38:50 UTC
Possibly a duplicate of bug 61669 ?
Comment 2 Dan Garry 2014-04-14 18:00:22 UTC
(In reply to Quiddity from comment #1)
> Possibly a duplicate of bug 61669 ?

Not a duplicate, but certainly related.
Comment 3 Nik Everett 2014-05-01 18:40:46 UTC
Rather then removing the text, we've moved it into another field:
https://gerrit.wikimedia.org/r/#/c/127140/

We'll still search them, but they'll be worth less and less likely to be highlighted.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links