Menu
  • HOME
  • TAGS

Solr DataImportHandler Cache Support for Multiple Values

caching,solr,dataimporthandler

Problem is fixed when the where part of the cached query is removed. I'm not sure the following configuration is ideal but what I understand is the aim is reducing the count of queries. <entity name="product" query="SELECT product_id FROM Product WHERE 1"> <entity name="strength" query="SELECT * FROM Strength WHERE product_id...

DataImportHandler DIH for FileSystem DataSource

solr,datasource,dataimporthandler,dih,lucidworks

Thanks, I did it, and below is the needed dataconfig <?xml version="1.0" encoding="UTF-8"?> <dataConfig> <dataSource type="BinFileDataSource" /> <document> <entity name="pdf" processor="FileListEntityProcessor" baseDir="/path/to/my/pdf" fileName=".*pdf" newerThan="'NOW-3DAYS'" recursive="true" rootEntity="false" dataSource="null"> <field column="fileAbsolutePath" name="id" /> <entity name="documentImport" processor="TikaEntityProcessor" url="${pdf.fileAbsolutePath}" format="text">...

SOLR RSS dataimport handler is not working

solr,dataimporthandler

I solved this problem. Problem was just in xml file formate. when I change my xml file and some configuration it worked for me. Here is my config file. <dataConfig> <dataSource type="URLDataSource" /> <document> <entity name="slashdot" pk="link" url="http://www.feedforall.com/sample.xml" processor="XPathEntityProcessor" forEach="/rss/channel/item" transformer="DateFormatTransformer"> <field column="source" xpath="/rss/channel/title" commonField="true" /> <field column="source-link" xpath="/rss/channel/link"...

Tika - url / file path issue

solr,solr4,dataimporthandler,apache-tika

You can use the RegexTransformer of SolR : http://wiki.apache.org/solr/DataImportHandler#RegexTransformer add to your transformer attribute the RegexTransformer : <entity name="db_link" query="SELECT ..." ... transformer="... ,org.apache.solr.handler.dataimport.RegexTransformer"...> modify field tag in row 'link' : <field column="link" regex="^([^#]+)#" sourceColName="LINK"/> That should be all EDIT regex corrected...

Solr date variable resolver is not working with MySql

mysql,oracle,solr,dataimporthandler

I checked out the source code for solr and tried to solve my issue. I had a fix for it and its working for me. The variable resolve in case of date is somehow making a array and so it appends the '[?, '28/05/2015 11:13:50']'. In the TemplateString.java in method...

Subentity SolrEntityProcessor stops working since SolR 5.x

solr,dataimporthandler,solr5

This is a known bug in Solr. Even I have come across! I posted this as an answer because, this is a bug and there's no solution from the author! We have actually downgraded the version in order to get rid of this bug. I am not sure if this...

Spring data solr - Full Data import via DIH

solr,dataimporthandler,spring-data-solr

The missing piece is setting a ReponseParser that understands the way Solr DIH answers the request. Adding XMLResponseParser should do the trick. template.execute(new SolrCallback<SolrResponse>() { @Override public SolrResponse doInSolr(SolrServer solrServer) throws SolrServerException, IOException { SolrRequest request = new SolrRequest(METHOD.GET, "/dataimport?command=status") { @Override public SolrResponse process(SolrServer server) throws SolrServerException, IOException {...

Data import in solr from multiple entities

solr,dataimporthandler,data-import

Finally got a solution. We need to modify data-config.xml file, under which there would be 3 different entities. 2 from MySQL and 1 from Solr core itself. For MySQL, entity should be something like this <entity name="entity name" dataSource="data source created in the file" query="SQL query to retrieve the data"...