Text this: Dimension reduction and clustering of high dimensional data using auto-associative neural networks