Text this: Cluster-based compound selection using fuzzy clustering