DiSeg is the first discourse segmenter for Spanish using the framework
of the Rhetorical Structure Theory (Mann and Thompson, 1988)
based on lexical and syntactic rules.

If you want to test it, you can use this demo
(enter your text in Spanish with utf8 enconding):


You can also try DiSeg on these corpus. The source code is availabble under GPL here as a shell/perl script running on linux and relying on FreeLing. An online API is also available, contact eric.sanjuan@univ-avignon.fr for details.

If you like DiSeg, please cite us as follows:

