apache - Nutch 2.x No errors, No results neither -
i've been playing nutch 2.x awhile, have set according nutch 2.x tutorial advised in this post , still can't figure out - appreciated.
when using inject command per tutorial, injects 2 urls have in seeds.txt:
nutch inject ../local/urls/seed.txt
but when running script doesn't visit of urls:
bin/crawl ../local/urls/seed.txt testcrawl *ttp://l*calhost:8983/solr 2
i've started again complete new install of nutch 2.2.1 - hbase-0.94.10 , solr 4.4.0 advised vy on mailinglist, due versions mentioned in tutorial years old, , error i'm getting is:
[root@localhost local]# bin/nutch inject /urls/seed.txt injectorjob: starting @ 2013-08-11 17:59:32 injectorjob: injecting urldir: /urls/seed.txt injectorjob: org.apache.gora.util.goraexception: java.lang.runtimeexception: java.lang.illegalargumentexception: not host:port pair: �2249@localhost.localdomainlocalhost,45431,1376235201648
Comments
Post a Comment