Text this: Word-length algorithm for language identification of under-resourced languages