Multi-language searching involves many complex trade-offs. It's probably best to ask questions about your specific use case on the Solr user list.
Here is our configuration. Note that we turn autoGeneratePhraseQueries="false".
Tom
<fieldType name="FullText" class="solr.TextField" positionIncrementGap="100"AutoGeneratePhraseQueries="false"> <analyzer type="index"> <tokenizer class="solr.ICUTokenizerFactory"/> <filter class="solr.ICUFoldingFilterFactory"/> <filter class="solr.CommonGramsFilterFactory" words="1000common.txt"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.ICUTokenizerFactory"/> <filter class="solr.ICUFoldingFilterFactory"/> <filter class="solr.CommonGramsQueryFilterFactory" words="1000common.txt"/> </analyzer></fieldType>
<analyzer type="index"> <tokenizer class="solr.ICUTokenizerFactory"/> <filter class="solr.ICUFoldingFilterFactory"/> <filter class="solr.CommonGramsFilterFactory" words="1000common.txt"/> </analyzer>
<analyzer type="query"> <tokenizer class="solr.ICUTokenizerFactory"/> <filter class="solr.ICUFoldingFilterFactory"/> <filter class="solr.CommonGramsQueryFilterFactory" words="1000common.txt"/> </analyzer></fieldType>
Multi-language searching
Multi-language searching involves many complex trade-offs. It's probably best to ask questions about your specific use case on the Solr user list.
Here is our configuration. Note that we turn autoGeneratePhraseQueries="false".
Tom
<fieldType name="FullText" class="solr.TextField" positionIncrementGap="100"AutoGeneratePhraseQueries="false"><analyzer type="index">
<tokenizer class="solr.ICUTokenizerFactory"/>
<filter class="solr.ICUFoldingFilterFactory"/>
<filter class="solr.CommonGramsFilterFactory" words="1000common.txt"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.ICUTokenizerFactory"/>
<filter class="solr.ICUFoldingFilterFactory"/>
<filter class="solr.CommonGramsQueryFilterFactory" words="1000common.txt"/>
</analyzer>
</fieldType>