Semantic preserving text tepresentation and its applications in text clustering