jieba最早是以Java所撰寫的開源程式[4]。在斷詞時會先建立字典

jieba最早是以Java所撰寫的開源程式[4]。在斷詞時會先建立字典樹(Trie)，以句子中每個字為字首，將所有延伸的詞彙找出，接著對句子生成Directed Acyclic Graph, DAG，根據辭典中詞彙的出現頻率，以動態規劃方法找出最大切分組合，屬於全切分方法，並使用隱藏式馬可夫模型來判斷未知詞，從其他可觀測資料來計算未知資料的機率，也就是說，可以透過字的序列狀態去推測句子的結構，將每種結構的排列組合都做比較，計算出最符合的斷詞位置，找出最佳的斷詞結構和未知詞。

0/5000

原始語言: -

目標語言: -

結果 (英文) 1: [復制]

復制成功！

jieba is the earliest written Java open-source program [4]. When hyphenation will first establish trie (Trie), each word in the sentence prefix, extending all the vocabulary to find, then the sentence generating Directed Acyclic Graph, DAG, according to the frequency of occurrence of dictionary words to dynamic programming method to find the greatest combination of segmentation, are full segmentation method, and the use of hidden Markov models to determine the unknown word, to calculate the probability of an unknown data from other observations, that is, you can go through the sequence of the status word We speculated that the sentence structure, the permutations and combinations of each structure are compared to calculate the off position best suits the word, to find the best hyphenation structure and unknown words.

正在翻譯中..

結果 (英文) 2:[復制]

復制成功！

Jieba was first written as an open source program in Java. A dictionary tree (Trie) is established when word breaks are established, with each word in the sentence prefixed, all extended words are identified, and then directed Acyclic Graph, DAG, based on the frequency of the appearance of words in the dictionary, to find out the maximum slicing combination in a dynamic planning method, which is a full-cut method. The hidden Markov model is used to judge unknown words and calculate the probability of unknown data from other observable data, that is to say, the structure of sentences can be inferred from the sequence state of words, the arrangement combinationofs of each structure can be compared, the most consistent word-breaking position, to find out the best word-breaking structure and unknown words.

正在翻譯中..

結果 (英文) 3:[復制]

復制成功！

Jieba is the first open source program written in Java [4]. When breaking words, it will first establish a trie, prefix each word in the sentence, find out all the extended words, and then generate a directed acyclic graph for the sentence, DAG, according to the occurrence frequency of words in dictionaries, uses dynamic programming method to find the maximum segmentation combination, which belongs to the full segmentation method, and uses hidden Markov model to judge unknown words, and calculates the probability of unknown data from other observable data. That is to say, the sentence structure can be inferred through the sequence state of words, and the arrangement and combination of each structure can be compared Find out the most suitable word break position, find out the best word break structure and unknown word.<br>

正在翻譯中..

其它語言

本翻譯工具支援: 世界語, 中文, 丹麥文, 亞塞拜然文, 亞美尼亞文, 伊博文, 俄文, 保加利亞文, 信德文, 偵測語言, 優魯巴文, 克林貢語, 克羅埃西亞文, 冰島文, 加泰羅尼亞文, 加里西亞文, 匈牙利文, 南非柯薩文, 南非祖魯文, 卡納達文, 印尼巽他文, 印尼文, 印度古哈拉地文, 印度文, 吉爾吉斯文, 哈薩克文, 喬治亞文, 土庫曼文, 土耳其文, 塔吉克文, 塞爾維亞文, 夏威夷文, 奇切瓦文, 威爾斯文, 孟加拉文, 宿霧文, 寮文, 尼泊爾文, 巴斯克文, 布爾文, 希伯來文, 希臘文, 帕施圖文, 庫德文, 弗利然文, 德文, 意第緒文, 愛沙尼亞文, 愛爾蘭文, 拉丁文, 拉脫維亞文, 挪威文, 捷克文, 斯洛伐克文, 斯洛維尼亞文, 斯瓦希里文, 旁遮普文, 日文, 歐利亞文 (奧里雅文), 毛利文, 法文, 波士尼亞文, 波斯文, 波蘭文, 泰文, 泰盧固文, 泰米爾文, 海地克里奧文, 烏克蘭文, 烏爾都文, 烏茲別克文, 爪哇文, 瑞典文, 瑟索托文, 白俄羅斯文, 盧安達文, 盧森堡文, 科西嘉文, 立陶宛文, 索馬里文, 紹納文, 維吾爾文, 緬甸文, 繁體中文, 羅馬尼亞文, 義大利文, 芬蘭文, 苗文, 英文, 荷蘭文, 菲律賓文, 葡萄牙文, 蒙古文, 薩摩亞文, 蘇格蘭的蓋爾文, 西班牙文, 豪沙文, 越南文, 錫蘭文, 阿姆哈拉文, 阿拉伯文, 阿爾巴尼亞文, 韃靼文, 韓文, 馬來文, 馬其頓文, 馬拉加斯文, 馬拉地文, 馬拉雅拉姆文, 馬耳他文, 高棉文, 等語言的翻譯.