Publication

December 2, 2014

Semiotic Indexing of Digital Resources

A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.

Parker, C.T. and Garrity, G.M. Semiotic Indexing of Digital Resources; 2014. United States Patent and Trademark Office.

U.S. Patent Grant No. 8,903,825 (4.6MB PDF)

[permalink] Posted December 2, 2014.

Back to top