⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 gfilter.pl

📁 namazu. 虽然是日语的,也适用于文件中单词索引后全文检索.
💻 PL
字号:
## -*- Perl -*-# $Id: gfilter.pl,v 1.1.2.1 2001/09/21 08:11:07 knok Exp $# Copyright (C) 1999 Satoru Takabayashi ,#     This is free software with ABSOLUTELY NO WARRANTY.##  This program is free software; you can redistribute it and/or modify#  it under the terms of the GNU General Public License as published by#  the Free Software Foundation; either versions 2, or (at your option)#  any later version.# #  This program is distributed in the hope that it will be useful#  but WITHOUT ANY WARRANTY; without even the implied warranty of#  MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the#  GNU General Public License for more details.##  You should have received a copy of the GNU General Public License#  along with this program; if not, write to the Free Software#  Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA#  02111-1307, USA##  This file must be encoded in EUC-JP encoding#package gfilter;# Show debug information for filterssub show_filter_debug_info ($$$$) {    my ($contref, $weighted_str, $fields, $headings) = @_;    util::dprint("-- title --\n$fields->{'title'}\n") 	if defined $fields->{'title'};    util::dprint("-- content --\n$$contref\n");    util::dprint("-- weighted_str: --\n$$weighted_str\n");    util::dprint("-- headings --\n$$headings\n");}# Adjust white spacessub white_space_adjust_filter ($) {    my ($text) = @_;    $$text =~ s/[ \t]+/ /g;    $$text =~ s/\r\n/\n/g;    $$text =~ s/\r/\n/g;    $$text =~ s/\n+/\n/g;    $$text =~ s/^ +//gm;    $$text =~ s/ +$//gm;    $$text =~ s/ +/ /g;    # Control characters be into space    $$text =~ tr/\x00-\x09\x0b-\x1f/  /;}# get a title from a file name.sub filename_to_title ($$) {    my ($cfile, $weighted_str) = @_;    # for MSWin32's filename using Shift_JIS [1998-09-24]    if (($mknmz::SYSTEM eq "MSWin32") || ($mknmz::SYSTEM eq "os2")) {	$cfile = codeconv::shiftjis_to_eucjp($cfile);	$cfile = codeconv::eucjp_han2zen_kana($cfile);    }        $cfile =~ m!^.*/([^/]*)$!;    my $filename = $1;    # get keywords from a file name.    # modified [1998-09-18]     my $tmp = $filename;    $tmp =~ s|/\\_\.-| |g;    my $weight = $conf::Weight{'html'}->{'title'};    $$weighted_str .= "\x7f$weight\x7f$tmp\x7f/$weight\x7f\n";    my $title = $filename;    return $title}# Remove SPACE/TAB at the beginning or ending of the line.# And remove '>|#:' at the begenning of the line.# Join hyphenation for English text.# Remove LF if the line is ended with a Japanese character and # length of the line is 40 or more longer.## Original of this code was contributed by <furukawa@tcp-ip.or.jp>. # [1997-09-15]#sub line_adjust_filter ($) {    my ($text) = @_;    return undef unless defined($$text);    my @tmp = split(/\n/, $$text);    for my $line (@tmp) {	$line .= "\n";	$line =~ s/^[ \>\|\#\:]+//;	$line =~ s/ +$//;	$line =~ s/\n// if (($line =~ /[\xa1-\xfe]\n*$/) &&			    (length($line) >=40));	$line =~ s/(。|、)$/$1\n/;	$line =~ s/([a-z])-\n/$1/;  # for hyphenation.    }    $$text = join('', @tmp);}# not implimented yet.sub analize_rcs_stamp(){}1;

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -