21-04-2021 дата публикации
Номер: GB0002588251A
Принадлежит:
An image of an invoice (e.g. rasterised document, PNG image etc, may contain logo in the header) is split into regions (302, see top, middle and bottom parts 102, 103 and 104 of the invoice, fig 1). A perceptual image hash is calculated from each region of the invoice image (303). Each hash is then matched against an entry in a database where the vendor name is known (look up match 304) by calculating a hamming distance between an image hash and each entry in the database of known vendors and taking the smallest hamming distance. In this way, the database is searched for a match and the vendor associated with the invoice is identified. The perceptual image hash may be calculated using: an average or difference or a pHash algorithm. The invoice image may be reduced to greyscale and to an 8x8 grid of pixels. If the sum of the best two hamming distances of the hashes of two of the invoice regions is greater than a threshold, then the invoice 101 is not considered to match the database of known ...
Подробнее