LinguaStream
Encyclopedia
LinguaStream is a generic platform for Natural Language Processing
(NLP), based on incremental enrichment of electronic documents. LinguaStream is developed at the GREYC computer science research group (Université de Caen) since 2001. It is available for free for private use and research purposes.
, semantics
, discourse
or statistical. Each stage of the processing stream discovers and produces new information, on which the subsequent steps can rely. At the end of the stream, several tools allow analysed documents and their annotations to be conveniently visualised.
LinguaStream is above all a virtual laboratory targeted to researchers in NLP. It allows for complex experiments on corpora to be realised conveniently, using various types of declarative formalisms, and reducing considerably the development costs. Its uses range from corpora exploration to the development of fully functional automatic analysers. An integrated environment is provided with the platform, where all the steps of the realisation of an experiment can be achieved.
API. For example, it can be integrated with Java EE servers to develop web applications based on processing streams. It is also used for teaching, and provides specific modules dedicated to students.
Natural language processing
Natural language processing is a field of computer science and linguistics concerned with the interactions between computers and human languages; it began as a branch of artificial intelligence....
(NLP), based on incremental enrichment of electronic documents. LinguaStream is developed at the GREYC computer science research group (Université de Caen) since 2001. It is available for free for private use and research purposes.
Description
LinguaStream allows complex processing streams to be designed and evaluated, assembling analysis components of various types and levels: part-of-speech, syntaxSyntax
In linguistics, syntax is the study of the principles and rules for constructing phrases and sentences in natural languages....
, semantics
Semantics
Semantics is the study of meaning. It focuses on the relation between signifiers, such as words, phrases, signs and symbols, and what they stand for, their denotata....
, discourse
Discourse
Discourse generally refers to "written or spoken communication". The following are three more specific definitions:...
or statistical. Each stage of the processing stream discovers and produces new information, on which the subsequent steps can rely. At the end of the stream, several tools allow analysed documents and their annotations to be conveniently visualised.
LinguaStream is above all a virtual laboratory targeted to researchers in NLP. It allows for complex experiments on corpora to be realised conveniently, using various types of declarative formalisms, and reducing considerably the development costs. Its uses range from corpora exploration to the development of fully functional automatic analysers. An integrated environment is provided with the platform, where all the steps of the realisation of an experiment can be achieved.
Technology
As a platform, LinguaStream provides an extensive JavaJava (programming language)
Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java platform. The language derives much of its syntax from C and C++ but has a simpler object model and fewer low-level facilities...
API. For example, it can be integrated with Java EE servers to develop web applications based on processing streams. It is also used for teaching, and provides specific modules dedicated to students.