Text this: Multimodal fusion: Gesture and speech input in augmented reality environment