Statistics for Semi-automatic matching of semi-structured data updates