Word embeddings: from vectors to mixtures of Gaussians