Statistics for Using language similarities in retrieval for resource scarce languages: a study of several southern Bantu languages