Vietnamese NLP

Word Segmentation

Name Language Desciption
JVnSegmenter  java A Java-based Vietnamese Word Segmentation Tool
DongDu  C++ A Vietnamese word segmentation tool.
Online Tool from VLSP  online

Part-of-speech tagging

Name Language Desciption
vnTagger  java Vietnamese part-of-speech tagging

Corpus

Name Desciption
VNESEcorpus 650.000 sentences
10.000 articles from vietnamnet.vn, dantri.com.vn, nhanhdan.com.vn
64.59 Mb
VNTQcorpus(small) 300.000 sentences
1.000 articles from vnthuquan.net
~35 Mb
VNTQcorpus(big) 1.750.000 sentences
13.000 articles from vnthuquan.net
~240 Mb

Named Entity Recognition

Unknown

Relations Extraction Systems

Unknown

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s