天天看點

Nutch2.3+Hbase0.94環境搭建

1,修改nutch-site.xml  <property>      <name>storage.data.store.class</name>      <value>org.apache.gora.hbase.store.HBaseStore</value>      <description>Default class for storing data</description>     </property>     <property>      <name>http.agent.name</name>      <value>JustinNutchAgent</value>     </property>  <property>   <name>plugin.includes</name>   <value>protocol-httpclient|urlfilter-regex|index-(basic|more)|query-(basic|site|url|lang)|indexer-solr|nutch-extensionpoints|protocol-httpclient|urlfilter-regex|parse-(text|html|msexcel|msword|mspowerpoint|pdf)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|more|metadata)</value>  </property> 2,修改ivy.xml中包含org.apache.hadoop的對應的hadoop對應版本,我的對應版本為1.2.1     <dependency org="org.apache.gora" name="gora-hbase" rev="0.5" conf="*->default" />     <dependency org="org.apache.gora" name="gora-core" rev="0.5" conf="*->default"/>     <dependency org="org.apache.gora" name="gora-compiler-cli" rev="0.5" conf="*->default"/>     <dependency org="org.apache.gora" name="gora-compiler" rev="0.5" conf="*->default"/> 3,在gora.properties中增加             gora.datastore.default=org.apache.gora.hbase.store.HBaseStore 4,修改build.xml修改hadoop-*test*.jar改為hadoop-*.jar

繼續閱讀