Statistics for Rita+ : an SGML based document processing system