Solr DIH indexing is fast but search is very slow -
i have database table containing ~30 gb of data. indexing dih. indexing data takes 1 hr 15 minutes search slow takes around 1 minute doesn't seem right. please help, if has faced same issue.
i proving content of files.
data-config.xml
<dataconfig> <datasource type="jdbcdatasource" driver="com.mysql.jdbc.driver" url="jdbc:mysql://battrdbtest20/test_results" batchsize="-1" user="results" password="resultsloader"/> <document> <entity name="syndrome" pk="test_file_result_id" query="select * syndrome"> <field column="test_file_result_id" name="test_file_result_id"/> <field column="syndrome" name="syndrome"/> </entity> </document> </dataconfig>
schema.xml (changed fields suit data)
<fields> <field name="test_file_result_id" type="slong" indexed="true" stored="true" required="true" omitnorms="true" multivalued="false" /> <field name="syndrome" type="string" indexed="true" stored="true" required="true" omitnorms="false" multivalued="false" /> </fields> <uniquekey>test_file_result_id</uniquekey> <defaultsearchfield>syndrome</defaultsearchfield>
no change in solrconfig.xml
test_file_result_id id of 10 digits. , syndrome field stores blob contain huge data )kind of log file content).
i mention when search test_file_result_id, search results comes within second syndrome, take more minute.
thanks in advance!!
i assuming string
defined solr.strfield
in schema.xml
.
since having blob of data, possibly useful use field type has right set of tokenizers, analyzers , filters.
for example, adding standardtokenizerfactory keeps tokens meaningful value set.
an example of fieldtype definition:
<fieldtype name="text_general" class="solr.textfield" positionincrementgap="100" omitnorms="true"> <analyzer type="index"> <tokenizer class="solr.standardtokenizerfactory" /> <filter class="solr.stopfilterfactory" ignorecase="true" words="stopwords.txt" enablepositionincrements="true" /> <filter class="solr.lowercasefilterfactory" /> </analyzer> </fieldtype>
you try , should make difference response time.
Comments
Post a Comment