Beyond Verbal Multiword Expressions (VMWEs)

 

This initiative should be seen as a follow-up of the activities involved in the preparation of the shared task within PARSEME.

The main aim is to enrich the corpora that are going to be annotated with VMWEs for the shared tasks with annotation of MWEs of other parts of speech (nominal, adjectival, adverbial).

What we build on:

  • already preprocessed corpora for each language;

  • already tested instruments for annotation (FLAT);

  • already existing network of interested people;

  • experienced researchers in writing guidelines and in annotating texts.

In order to achieve our final goal, we need to invest enthusiasm, time and energy in the following activities:

  • write guidelines for the annotation of such MWEs, in the same style as for the VMWEs (not English-centered, clear enough for the annotators to choose easily among categories) ‚Äì this will involve language leaders;

  • annotation of the multilingual corpora by native speakers (preferably) according to the guidelines;

  • coordinate a language team, as well as a language leaders team;

  • find funding sources, if possible: national and international levels ‚Äì everybody will be involved; think of: summer camps, bilateral projects, consortium projects, etc.

The foreseen results are:

  • annotation guidelines for nominal, adjectival, adverbial MWEs;

  • multilingual corpora annotated with MWEs;

  • open access to these resources for the community.

Contact person: Verginica Mititelu, This email address is being protected from spambots. You need JavaScript enabled to view it.