

浏览全部资源
扫码关注微信
1. 新疆大学信息科学与工程学院
2. 新疆多语种信息技术重点实验室
Published:2011
移动端阅览
[1]王健,哈力木拉提·买买提.印刷体维吾尔文识别后处理[J].新疆大学学报(自然科学版),2011,28(02):248-252.
王健, 哈力木拉提·买买提. 印刷体维吾尔文识别后处理[J]. Journal of Xinjiang University (Natural Science Edition in Chinese and English), 2011, 28(2): 248-252.
本文主要讨论将N-gram模型与编辑距离算法运用于印刷体维吾尔文识别后处理.由于印刷体维吾尔文识别系统的识别错误有一定规律性
所以研究中对识别错误进行了比较、分析、分类、并在编辑距离算法中加入识别错误的权值
以提高识别的正确率.最后
通过实验证明本算法能有效提高识别的正确率.
In this paper
we apply the N-gram model and the algorithm of Levenshtein Distance to Printed Uygur character recognition post-processing.The recognition errors of the system of Printed Uygur character recognition is a regular pattern
by setting weigh of the recognition errors in the algorithm of Levenshtein Distance based on the comparison and analysis and class of the recognition errors
the correct rate of the recognition were improved.Finally
the results of the experiments indicate that the method can definitely increase the correct rate of the recognition.
福克尔.米勒(Volker Muller).用于统计调查的文字识别后处理方法[J].模式识别与人工智能,1992,5(2):129-133.
邢永康,马少平.统计语言模型综述[J].计算机科学,2003,30(9):22-26.
LEVENSHTEIN V L.Binary codes capable of correcting deletions,insertions and reversals[J].Doklady Akademii NaukSSSR,1966,163(4):707-710.
LOWRANCE R,WAGNER R A.An extension of the string-to-string correction problem[J].Journal of the ACM,1975,22(2):177-183.
董广宇,吕学强,等.基于Ngram语言模型的汉字识别后处理研究[J].微计算机信息,2009,25(10):276-278.
赵作鹏,尹志民,等.一种改进的编辑距离算法及其在数据处理中的应用[J].计算机应用,2009,29(2):424-426.
0
Views
114
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621