Clustering-Based Pre-Processing Approaches To Improve Similarity Join Techniques