Online Tools English Text Math Webmaster Other

Online text analysis tool algorithm

Allows you to find the most frequent phrases and frequencies of words. Non-English language texts are supported. - we use some variant of "shingling" algorithm
- instead of words we handle with hash values
- each word get its hash value
- for phrases we group hash values of words with << operator, hash(phrase)=hash(word1)<<7+hash(word2)<<14+....
- we create several HashTables with (hashvalue, occurencies) pair, read all hashvalues from the text and update occurencies
- extract data from these HashTables, find some hashvalues with most occurencies, and than finds its original values in the text

Terms of use Contact us. We are very interested in ideas for new tools and bugs in existing. About us