Text this: Object detection and representation method for surveillance video indexing