java - impl.ConcurrentUpdateSolrServer: Status for: {file-path}is 404 -


i want index corpus using solr.
create sequence file, used following command:

./behemoth -i file://path/to/my/file/where/the corpus/is/located -o /user/user-name/file-to-which-the-putput-is-stored 

after gave following command indexing:
./behemoth solr /user/user-name/pth-to-which-output-is-stored-in-previous-command http://localhost:8983/solr

but giving following error:

15/06/04 11:51:07 info mapreduce.job: job job_local183059797_0001 running in uber mode : false   15/06/04 11:51:07 info mapreduce.job:  map 0% reduce 0%   15/06/04 11:51:08 info mapred.localjobrunner:    15/06/04 11:51:08 info impl.concurrentupdatesolrserver: status for: file:///usr/local/asr/data/corpus/en_thetelegraph_2001-2010/telegraph_2007-2010/telegraph_1st_oct_2007_to_31st_dec_2007/foreign/1071015_foreign_story_8435523.utf8 404   15/06/04 11:51:08 error impl.concurrentupdatesolrserver: error java.lang.exception: not found   

i unable figure out issue above mentioned file exists on path. please help

just found question, best ask on digitalpebble mailing list or open issue on github.

i don't think problem related content of input. looks more can't connect solr.

also you've imported corpus of documents no text or metadata have been extracted part of import. should run tika module on input first.


Comments

Popular posts from this blog

Magento/PHP - Get phones on all members in a customer group -

php - Bypass Geo Redirect for specific directories -

php - .htaccess mod_rewrite for dynamic url which has domain names -