TermSuite is a Java UIMA-based toolbox for terminology extraction and multilingual term alignment.
Multiword and compound term detection, morphosyntactic analysis, term variant detection, term specificity computation, etc. See features
Current version of TermSuite is 2.2 See Changelog
Prepare your system for TermSuite, download, install and get it running on an example corpus quickly.
List of all TermSuite's features, analysis engines, and configuration parameters. Java API.
|POS Tagging (3rd party: with TreeTagger or Mate)|
|Lemmatization (3rd party: with TreeTagger or Mate)|
|Efficient multiword term detection|
|Term syntactic variants detection|
|Term graphic variants detection|
|Term semantic variants detection (to come in 3.0)|
|Term morphology extraction|
|Term morphosyntactic variants detection|
|Term specificity (Weirdness Ratio) computing and other term measures: WR log, term frequency, etc|
|Term alignment (distributional and compositional, multilingual and monolingual)|
|Terminology export in multiple formats: `json`, `tsv`, `tbx`|