Database of Lithuanian Nominal Phrases Dictionary is the first corpus-based dictionary of Lithuanian multi-word units.
The lexicon reflects a big variety of multi-word units: collocations, phrasal combinations, longer pieces of text (sometimes even several sentences). The identification of the nominal phrases is based on gravity counts in the 100m words Contemporary Corpus of Lithuanian Languages. The corpus reflects the written Lithuanian in the period of 1991-2002. The list of automatically identified MWU‘s was edited by linguists, only leaving meaningful and grammatically correct phrases that include at least one noun. The dictionary comprises almost 69 thousand nominal phrases.