包括seqing、zhengzhi等多种类别敏感词库,共604KB大小。 可应用在网站中做敏感词过滤,从而净化网络,创造和谐社会。 压缩包种有10个txt文件,类别已分好。
2022-05-17 11:52:31 223KB 敏感词库
1
Sensitive-word-filtering 敏感词过滤 直接从Java版DFA算法实现,转移过来,并未作修改,词库直接从百度搜索的一个暂用。 可替换成其他词库,也可自行添加词条。 在项目init方法中调用ReadSwfDict加载词库 在检查点调用Match方验证,或调用Repl方法替换 具体使用见swf_test.go
2022-05-03 12:59:52 3KB Go
1
Sensitive敏感词过滤类,源代码分享(本人开发的代码,可以使用,但是禁止通过其他方式进行转载,谢谢)
2022-05-03 12:57:05 908KB Sensit 敏感词 敏感词过滤 源码
1
主要介绍了C#敏感词过滤实现方法,涉及C#针对字符串操作的常用技巧,非常具有实用价值,需要的朋友可以参考下
2022-04-10 18:33:37 59KB C# 敏感词过滤
1
Java实现DFA算法实现敏感词过滤,并且支持跳词,如中 国,中*国等,并包含了比较全的敏感词库,也摘抄了其它几种敏感词过滤方法,只对DFA方式的敏感词做了优化,支持跳词
2022-04-05 16:44:51 3.41MB dfa  敏感词 过滤 java
1
Author: Richard Zhang. Mail: 89205975@qq.com This library filters sensitive phrases by user's configuration. Currently, only support UTF8 & ANSI encoded strings. The matching rule is max-length-matching, the library tries to match sensitive phrase as long as possible. For example: "damn fucker" and "damn" are all in sensitive dictionary, the sentence "he's a damn fucker" will be processed to "he's a ***********". Even user insert some spaces or non-letter characters between sensitive words, the library is also able to deal with it. For example: "Bad boy" is added to sensitive dictionary, "Bad.boy", "Bad boy", "Bad/boy" can also be filtered. "你去死" is added to sensitive dictionary, "你 去 死", "你/去 死", "你 去 .死" can also be filtered. Compiling requirement: 1. STL C++11 2. BOOST multi_index_container Performance test condition: 1. Giving a sentence around 100 bytes (English & Chinese mixed) 2. Dirty phrases around 10,000 3. Do 1,000 loop test 4. Intel I7 CPU Test result: For each loop, it cost around 100us
2022-04-02 17:47:14 4KB 脏话 敏感词 聊天 过滤
1
java使用dfa算法实现敏感词过滤,此算法效率最高,附带了一个敏感词库,轻松搞定论坛网站的敏感词过滤问题。
2022-03-30 13:14:46 1.39MB 敏感词过滤 dfa Java
1
如下所示: #!/usr/bin/python2.6 # -*- coding: utf-8 -*- import time class Node(object): def __init__(self): self.children = None # The encode of word is UTF-8 def add_word(root,word): node = root for i in range(len(word)): if node.children == None: node.children = {} node.chil
2022-03-01 10:49:08 52KB python python算法 敏感词
1
一款高质量的敏感词过滤器 , 采用MFC编写 , 注释详细 , 算法简洁易懂 . 非常适合学习MFC或字符串匹配的同学.
2022-02-19 11:21:42 2.17MB mfc 字符串匹配 敏感词 过滤
1
1.企微会话存档数据官方解析处理流程,多线程同步,保证速度和效率 2.注释清晰,开箱即用 3.默认实时记录seq队列值,增量运行 4.动态同步指定范围数据 5.cos文件上传 6.es数据存储 7.敏感词过滤