close
Warning:
Can't synchronize with repository "(default)" (/usr/svn/silverfile does not appear to be a Subversion repository.). Look in the Trac log for more information.
- Timestamp:
-
Jun 26, 2010, 9:15:49 PM (13 years ago)
- Author:
-
hank
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
v14
|
v15
|
|
171 | 171 | |
172 | 172 | == Prepare for a Crawl == |
173 | | First create urls/files so that nutch knows what to crawl. This file can |
174 | | be anywhere, but I'll put it in the main crawl_data directory for now. |
175 | | {{{ |
176 | | mkdir /usr/wwwapps/crawl_data/urls/ |
177 | | }}} |
178 | | Create/edit file called '''files''' in urls directory. Add the following line |
| 173 | First create crawl_urls/files so that nutch knows what to crawl. This file can |
| 174 | be anywhere, |
| 175 | {{{ |
| 176 | mkdir /usr/wwwapps/crawl_urls/ |
| 177 | }}} |
| 178 | Create/edit file called '''files''' in crawl_urls directory. Add the following line |
179 | 179 | to search the /FILES/ directory: |
180 | 180 | {{{ |
… |
… |
|
189 | 189 | == Run a Crawl == |
190 | 190 | {{{ |
191 | | nutch crawl /usr/wwwapps/crawl_data/urls -dir /usr/wwwapps/crawl_data/ |
| 191 | nutch crawl /usr/wwwapps/crawl_urls -dir /usr/wwwapps/crawl_data/ |
192 | 192 | }}} |
193 | 193 | |