Last modified: 2014-02-13 22:10:51 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T56669, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 54669 - String "PO:BAR:AZU" is not searched as a whole
String "PO:BAR:AZU" is not searched as a whole
Status: RESOLVED WONTFIX
Product: MediaWiki extensions
Classification: Unclassified
MWSearch (Other open bugs)
unspecified
All All
: Low normal (vote)
: ---
Assigned To: Nobody - You can work on this!
cirrus-fixed
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-09-26 21:43 UTC by Massimo Palmieri
Modified: 2014-02-13 22:10 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Massimo Palmieri 2013-09-26 21:43:42 UTC
when I serach for string PO:BAR:AZU I wish retrieve only the string "PO:BAR:AZU", not string "PO:BAR" close to string "AZU". Thank you
Comment 1 Nik Everett 2013-09-27 19:35:44 UTC
Do you have a concrete example, maybe a link to a search that isn't returning what you need?

The problem with this request in general is that (mostly) search breaks up terms using http://www.unicode.org/reports/tr29/#Word_Boundaries when articles are changed and changing that requires rebuilding the index of all articles _and_ most users actually want to split words on colons.

An option that you have now is to search for "PO:BAR:AZU"~0 which will search for PO, BAR, and AZU which are _right_ next to each other.
Comment 2 Nik Everett 2013-09-30 12:36:13 UTC
Despite my dithering I'll prioritize this to low for now and see what I can do about it when I get to it.
Comment 3 Nik Everett 2013-09-30 18:17:20 UTC
So I had a look at this and it looks like the default text segmentation algorithm seems to do exactly what you want.  Have a look here:

https://test2.wikipedia.org/w/index.php?search=PO%3ABAR%3AAZU&title=Special%3ASearch&fulltext=1

Can you give another example?  Closing as WORKSFORME for now.
Comment 4 Massimo Palmieri 2013-09-30 19:06:31 UTC
If I search (in it:wiki) for the string PO:BAR:AZU I retrieve 34 entries (see [[https://it.wikipedia.org/w/index.php?search=PO%3ABAR%3AAZU&title=Speciale%3ARicerca]]), but only 7 of these entries are containing the string PO:BAR:AZU. The remaining entries have fractions of this string, but not the entire contiguous string.
I can't use the search string "PO:BAR:AZU", because the search tool is not able to retrieve this string (and so I couldn't able to use "PO:BAR:AZU"~0 because the search tool gives the same result (no entry). 
The search with the string PO:BAR:AZU~0 retrieves 3 entries, but they are absolutely not useful, and I don't know why. 
Excuse for my request, but these strings are used to identify heraldic forms in descriptions like mathematical formulas and I need use the entire string to make searches.
Thanks
Massimo Palmieri
Comment 5 Nik Everett 2013-09-30 19:18:55 UTC
it:wiki isn't using CirrusSearch!  it:wiktionary does, but not it:wiki.  For what it is worth, I checked the Italian analyzer that we use for it:wiktionary and it leaved PO:BAR:AZU as a single token - which is what you want.

I'm sorry I can't be of more help with the other search tool.
Comment 6 Andre Klapper 2013-10-01 09:40:24 UTC
This bug report is INVALID for CirrusSearch as CirrusSearch is not involved in the problem. 
Changing component from CirrusSearch to MWSearch (though MWSearch will not receive much attention anymore).
Comment 7 Chad H. 2014-02-13 22:10:51 UTC
Marking WONTFIX as MWSearch/lsearch has reached end of life and is replaced by Cirrus/Elasticsearch.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links