Page MenuHomePhabricator

Add Hebrew to supported languages
Open, LowPublicFeature

Description

Currently, he is not in $elasticsearchLanguageAnalyzers.
"Elasticsearch doesn’t have any built in stop words for Hebrew (he) so we define a custom list pulled from an online list"
https://gibrown.wordpress.com/2013/05/01/three-principles-for-multilingal-indexing-in-elasticsearch/

Probably worth asking them to give us their stopword list and/or to commit it upstream, or doing one ourselves too from the same http://web.archive.org/web/20120821214110/http://wiki.korotkin.co.il/Hebrew_stopwords , mentioned also in https://wiki.apache.org/solr/LanguageAnalysis#Hebrew


Version: master
Severity: enhancement

Details

Reference
bz54879

Event Timeline

bzimport raised the priority of this task from to Low.Nov 22 2014, 2:33 AM
bzimport added projects: CirrusSearch, I18n.
bzimport set Reference to bz54879.
bzimport added a subscriber: Unknown Object (MLST).
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Aklapper changed the subtype of this task from "Task" to "Feature Request".Feb 4 2022, 11:13 AM

I see that https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-lang-analyzer.html does not contain Hebrew in its list. Is this what it is about? The links from 2013 do not make much sense a decade after.