Solr DIH indexing is fast but search is very slow -
i have database table containing ~30 gb of data. indexing dih. indexing data takes 1 hr 15 minutes search slow takes around 1 minute doesn't seem right. please help, if has faced same issue.
i proving content of files.
data-config.xml
<dataconfig>   <datasource type="jdbcdatasource"                driver="com.mysql.jdbc.driver"                url="jdbc:mysql://battrdbtest20/test_results"               batchsize="-1"               user="results"                password="resultsloader"/>    <document>     <entity name="syndrome"          pk="test_file_result_id"       query="select * syndrome">            <field column="test_file_result_id" name="test_file_result_id"/>         <field column="syndrome" name="syndrome"/>     </entity>   </document> </dataconfig> schema.xml (changed fields suit data)
 <fields>       <field name="test_file_result_id" type="slong" indexed="true" stored="true" required="true" omitnorms="true" multivalued="false" />      <field name="syndrome" type="string" indexed="true" stored="true" required="true" omitnorms="false" multivalued="false" />   </fields>   <uniquekey>test_file_result_id</uniquekey>   <defaultsearchfield>syndrome</defaultsearchfield> no change in solrconfig.xml
test_file_result_id id of 10 digits. , syndrome field stores blob contain huge data )kind of log file content).
i mention when search test_file_result_id, search results comes within second syndrome, take more minute.
thanks in advance!!
i assuming string defined solr.strfield in schema.xml.
since having blob of data, possibly useful use field type has right set of tokenizers, analyzers , filters.
for example, adding standardtokenizerfactory keeps tokens meaningful value set.
an example of fieldtype definition:
<fieldtype name="text_general" class="solr.textfield" positionincrementgap="100" omitnorms="true">   <analyzer type="index">     <tokenizer class="solr.standardtokenizerfactory" />     <filter class="solr.stopfilterfactory" ignorecase="true" words="stopwords.txt" enablepositionincrements="true" />     <filter class="solr.lowercasefilterfactory" />   </analyzer> </fieldtype> you try , should make difference response time.
Comments
Post a Comment