Indic W-shingling Library

What is W-shingling?

a w-shingling is a set of unique "shingles"—contiguous subsequences of tokens in a document—that can be used to gauge the similarity of two documents. The w denotes the number of tokens in each shingle in the set.

Read more about W-shingling

Supported Languages

English, Hindi, Malayalam, Kannada, Bengali

Enter the text for getting the w-shingling below. Language of each word will be detected. You can give the text in any language and even with mixed language.

W :


Python Wshingling API

This service provides indic w-shingling libraries
  • Method: modules.Shingling.wshingling
    • arg1 : the sentence
    • w : w of w-shingling (Optional)
    • Return : The w-shingling for the sentence