DiSeg
A
Discourse Segmenter for Spanish
DiSeg is
the first discourse segmenter for
Spanish using the framework of the Rhetorical Structure Theory
(Mann
and Thompson, 1988) and based on lexical and syntactic
rules. The system can be tested here.
One
of the best ways to evaluate a discourse segmenter is comparing its
results with
the results of other similar available systems. However, we have
developed the first discourse segmenter for Spanish, so we cannot use
another system for its evaluation. We have carried a gold
standard in order to encourage
other researchers to go on investigating in this field. You can consult
the original texts and the discourse text segmentations into the
following table. The segmentations are xml files.