Last modified: 2014-05-14 21:21:17 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T66350, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 64350 - Search for nonexistent page with quote mark in title doesn't show "create page link" on he.ws
Search for nonexistent page with quote mark in title doesn't show "create pag...
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
CirrusSearch (Other open bugs)
master
All All
: Normal normal with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
https://he.wikisource.org/w/index.php...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-04-24 07:33 UTC by Inkbug
Modified: 2014-05-14 21:21 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Inkbug 2014-04-24 07:33:39 UTC
1. Go to Hebrew Wikisource
2. Search for אילת השחר (מלבי"ם)/פרק נ
3. One is told:
   > לא נמצאו תוצאות המתאימות לחיפוש.
   Instead of:
   >  לא נמצאו תוצאות המתאימות לחיפוש.
   >אין בוויקיטקסט העברי דף בשם "אילת השחר (מלבי"ם)/פרק נ". ליצירת דף חדש בשם זה, לחצו על הקישור האדום (דפים מקושרים).
Comment 1 Andre Klapper 2014-04-24 08:33:08 UTC
Confirming, also happens for above link with &srbackend=CirrusSearch
Comment 2 Nahum Wengrov 2014-04-24 09:48:14 UTC
Bug only exists for double quotes ("), no bug for artcles with a single quote (')
Comment 3 Nik Everett 2014-04-24 21:17:31 UTC
Switching to cirrus and raising priority because it belies a worse problem: In full text search Cirrus always interprets quotation marks as search syntax.  It looks like in Hebrew they could be a [[Gershayim]].  Cirrus will see them as an unbalance quote and helpfully balance it for the user and then interpret that as a phrase query.  So full text search with them will be broken.

So where do we go from here?  Is "quoting some words" normal in Hebrew to represent a phrase?  If so, we should keep that syntax.  If not then we can switch to whatever is normal for phrases and not use quotes at all for phrases in Hebrew.
Comment 4 Inkbug 2014-04-25 05:39:51 UTC
Quotation marks (מרכאות, “”) are used in Hebrew for phrases and quotes. Gershayim (גרשיים, ״) is used in Hebrew for acronyms.

On a regular Hebrew keyboard, none of these characters are found, and instead the typewriter double quote (") is used for both (this is also the practice on the Hebrew Wikisource).

However, it might be possible to differentiate between phrases and acronyms as follows:
1. If the " character has only one character after it (מלבי"ם), then it should be considered to be a Gershayim, and the word should be dealt with as an acronym.
2. Otherwise, it is a quotation mark, and it should be dealt with as a phrase.

This is not foolproof, but it should deal with most of the problems.
Comment 5 Nik Everett 2014-05-01 01:22:40 UTC
https://gerrit.wikimedia.org/r/#/c/130799/

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links