⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 hitcontent.java

📁 nutch搜索的改进型工具和优化爬虫的相关工具
💻 JAVA
字号:
/* Copyright (c) 2003 The Nutch Organization.  All rights reserved.   */
/* Use subject to the conditions in http://www.nutch.org/LICENSE.txt. */

package net.nutch.searcher;

import java.io.IOException;

import net.nutch.parse.ParseData;
import net.nutch.parse.ParseText;

/** Service that returns the content of a hit. */
public interface HitContent {
  /** Returns the content of a hit document. */
  byte[] getContent(HitDetails details) throws IOException;

  /** Returns the ParseData of a hit document. */
  ParseData getParseData(HitDetails details) throws IOException;

  /** Returns the ParseText of a hit document. */
  ParseText getParseText(HitDetails details) throws IOException;

  /** Returns the anchors of a hit document. */
  String[] getAnchors(HitDetails details) throws IOException;

  /** Returns the anchors of a hit document. */
  long getFetchDate(HitDetails details) throws IOException;

}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -