strigmetric是Scala的字符串相似性度量算法的库。(如:Dice/Sorese,Hammig,Jaccard,Jaro,Jaro-Wikler,Leveshtei,Metaphoe,N-Gram,NYSIIS,Overlap,Ratcliff/Obershelp,RefiedNYSIIS,RefiedSoudex,Soudex,WeightedLeveshtei)
这个库提供了一些工具来进行字符串相似度匹配,用来测量字符串的相似性与距离,通过单词的发音和声音的相似性比较来索引,除核心库之外,每个度量和算法都有一个命令行界面。
要求:Scala2.10+
文档:Scaladoc
问题:Ehacemets, Questios, Bugs
版本:SematicVersioigv2.0
依赖SBT:
libraryDepedecies += "com.rockymadde.strigmetric" %% "strigmetric-core" % "0.27.4"Gradle:
compile 'com.rockymadde.strigmetric:strigmetric-core_2.10:0.27.4'Mave:
<depedecy> <groupId>com.rockymadde.strigmetric</groupId> <artifactId>strigmetric-core_2.10</artifactId> <versio>0.27.4</versio></depedecy>创建CLIs$ git cloe https://github.com/rockymadde/strigmetric.git$ cd strigmetric$ sbt clea package$ ./project/build.sh$ ./target/cli/jarometric abc xyz使用CLIs获取帮助
$ metaphoemetric --helpCompares two strigs to determie if they are phoetically similarly, per the Metaphoe algorithm.Sytax: metaphoemetric [Optios] strig1 strig2...Optios: -h, --help Outputs descriptio, sytax, ad optios.获取度量的比较值
$ jarowiklermetric dog dawg0.75获取差异算法的代表值
$ metaphoealgorithm dogtk
评论