whitespacetokenizer.cs

来自「介绍有关全文检索的类库,以lucene为例,在.net环境下实现多种类型文档的全」· CS 代码 · 共 42 行

42 行

/*
 * Licensed to the Apache Software Foundation (ASF) under one or more
 * contributor license agreements.  See the NOTICE file distributed with
 * this work for additional information regarding copyright ownership.
 * The ASF licenses this file to You under the Apache License, Version 2.0
 * (the "License"); you may not use this file except in compliance with
 * the License.  You may obtain a copy of the License at
 * 
 * http://www.apache.org/licenses/LICENSE-2.0
 * 
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

using System;

namespace Lucene.Net.Analysis
{
	
    /// <summary>A WhitespaceTokenizer is a tokenizer that divides text at whitespace.
    /// Adjacent sequences of non-Whitespace characters form tokens. 
    /// </summary>
	
    public class WhitespaceTokenizer : CharTokenizer
    {
        /// <summary>Construct a new WhitespaceTokenizer. </summary>
        public WhitespaceTokenizer(System.IO.TextReader in_Renamed) : base(in_Renamed)
        {
        }
		
        /// <summary>Collects only characters which do not satisfy
        /// {@link Character#isWhitespace(char)}.
        /// </summary>
        protected internal override bool IsTokenChar(char c)
        {
            return !System.Char.IsWhiteSpace(c);
        }
    }
}

whitespacetokenizer.cs - 源码说明

本页面展示了「介绍有关全文检索的类库,以lucene为例,在.net环境下实现多种类型文档的全文检索.」中的 whitespacetokenizer.cs 源码文件，采用 CS 编程语言编写，共 42 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫开发者社区收录了大量与全文检索相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?