Thursdays the PTO publishes new pending patent applications. Both of today's Spotlight Applications concern aspects of data fingerprinting. Assigned to Cortica , Ltd. (IL), the first application discloses techniques for generating signatures for multimedia deep-content-classification by a large-scale matching system. Assigned to Workshare Technology, the second of today's applications discloses techniques for fingerprinting textual information using word runs.
20130080433, "Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof," assigned to Cortica, Ltd. (IL).
Content-based clustering, recognition, classification and search of high volumes of multimedia data in real-time. The embodiments disclosed herein are dedicated to real-time fast generation of signatures to high-volume of multimedia content-segments, based on relevant audio and visual signals, and to scalable matching of signatures of high-volume database of content-segments' signatures. The embodiments disclosed herein can be implemented in any applications which involve large-scale content-based clustering, recognition and classification of multimedia data, such as, content-tracking, video filtering, multimedia taxonomy generation, video fingerprinting, speech-to-text, audio classification, object recognition, video search and any other application requiring content-based signatures generation and matching for large content volumes such as, web and other large-scale databases.
20130074198, "Methods and systems to fingerprint textual information using word runs," apparently assigned to Workshare Technology, Inc..
The present invention provides methods and systems to enable fast, efficient, and scalable means for fingerprinting textual information using word runs. The present system receives textual information and provides algorithms to convert the information into representative fingerprints. In one embodiment, the fingerprints are recorded in a repository to maintain a database of an organization's secure data. In another embodiment, textual information entered by a user is verified against the repository of fingerprints to prevent unauthorized disclosure of secure data. This invention provides approaches to allow derivative works (e.g., different ordering of words, substitution of words with synonyms, etc.) of the original information to be detected at the sentence level or even at the paragraph level. This invention also provides methods and systems for enhancing storage and resource efficiencies by providing approaches to optimize the number of fingerprints generated for the textual information.