NLPub — catalogue of linguistic solutions

ноября 23, 2017

I want to introduce NLPub a small knowledge base, devoted to computational linguistics in Russia.

Now no surprise the devices and applications that can understand and speak human language. At the heart of such applications are the methods of natural language processing, forming the General direction at the intersection of linguistics and artificial intelligence.

Why the vast majority of devices, applications and services not working with Russian language?

I often have to repeat this, but the reason is simple and tragic. The fact that the decision of tasks of natural language processing involves the use of specialized programs analyzers, which are in dire need of information resources — dictionaries, thesauruses, thanks to which they are able to perform its function.

All of this in Russia that paralyzes the commercial enterprises and academic teams, forcing to reinvent the wheel, or simply abandon linguistic technologies.

The most useful thing you can do momentary is to help people to quickly adapt and soon begin to use the few available technologies that we have at the moment.

You need to make a catalogue of available software functionality, write training materials, provide links to data, guidance and other information resources. For this reason I have created NLPub and invite everyone to join in its development.

What information is collected within NLPub?

Special attention is paid to the following topics:

the

text processing tools, available for both commercial and non-commercial use — tokenization, morphological analysers, syntactic parsers, tools sentiment analysis;
resources dictionaries, thesauruses, corpora, required to solve fundamental and applied tasks;
activities — themed conferences and workshops for researchers and developers;
education — educational institutions and professional courses in the field of natural language processing and data analysis.

How can we help the project?

I see three methods available:

the

to replenish the knowledge base, providing readers a quality, correct and up to date material about the situation in the domestic computational linguistics;
improved made in the design and development of a knowledge base;
tell about NLPub in different thematic communities, increasing public interest in the field of natural language processing (at least in the blog about it, write about how I did).

Who it belongs to?

NLPub is a non-profit project and has no affiliation with commercial companies. It is in any case not close the way for commercial companies. On the contrary, posting information about their products is highly welcomed along with open and free solutions. Today in the list of instrumento you can find a lot of commercial products.

I will gladly answer all your questions and comments as in the comments here or via private channels.

Article based on information from habrahabr.ru

Поиск по этому блогу

computer express