Print

This page describes the generic text segmentation rules that should be used when preparing a new corpus to be annotated.

General remarks:

Generic rules for segmentation into sentences:

Generic rules for segmentation into tokens:

Known problems: