Tag Cloud & Text Classification

Considering we have so many texts, books, news, papers & scientific articles in many magazines and articles across the internet it is hard to classify them based on the content we have in them. Thus we require an application which, based on a set of given articles is able to display a tag cloud of the most popular words (consider multiple forms of the words used e.g. “lexemes” ; also consider setting a minimum number of words) from the specified set. Consider different visualization methods of the cloud, sphere shaped, ordered by most popular, different orientations of the words. Also display relationships between words that appear in the same sentence, the same paragraph, or appear near one another multiple times. It will also be possible to view the sentence that word initially belonged to.

Bonus: a side by side generation/comparison of tag clouds/texts.

  • Requirements: ;
  • Programming Skill Level: Beginner to Intermediate ;
  • Designer Skill Level: Intermediate to Advanced;
  • Other Skills: ;
