java - impl.ConcurrentUpdateSolrServer: Status for: {file-path}is 404 -
i want index corpus using solr.
create sequence file, used following command:
./behemoth -i file://path/to/my/file/where/the corpus/is/located -o /user/user-name/file-to-which-the-putput-is-stored
after gave following command indexing:
./behemoth solr /user/user-name/pth-to-which-output-is-stored-in-previous-command http://localhost:8983/solr
but giving following error:
15/06/04 11:51:07 info mapreduce.job: job job_local183059797_0001 running in uber mode : false 15/06/04 11:51:07 info mapreduce.job: map 0% reduce 0% 15/06/04 11:51:08 info mapred.localjobrunner: 15/06/04 11:51:08 info impl.concurrentupdatesolrserver: status for: file:///usr/local/asr/data/corpus/en_thetelegraph_2001-2010/telegraph_2007-2010/telegraph_1st_oct_2007_to_31st_dec_2007/foreign/1071015_foreign_story_8435523.utf8 404 15/06/04 11:51:08 error impl.concurrentupdatesolrserver: error java.lang.exception: not found
i unable figure out issue above mentioned file exists on path. please help
just found question, best ask on digitalpebble mailing list or open issue on github.
i don't think problem related content of input. looks more can't connect solr.
also you've imported corpus of documents no text or metadata have been extracted part of import. should run tika module on input first.
Comments
Post a Comment