{"id":11955,"date":"2022-01-07T07:22:35","date_gmt":"2022-01-07T07:22:35","guid":{"rendered":"https:\/\/www.itwriting.com\/blog\/?p=11955"},"modified":"2022-01-07T07:22:35","modified_gmt":"2022-01-07T07:22:35","slug":"converting-a-scanned-image-to-text-in-office-365","status":"publish","type":"post","link":"https:\/\/www.itwriting.com\/blog\/11955-converting-a-scanned-image-to-text-in-office-365.html","title":{"rendered":"Converting a scanned image to text in Office 365"},"content":{"rendered":"<p>I was emailed an attachment scanned from a magazine; it was a nuisance and I wanted to convert it to text. There are of course a million ways to do this and I recall that every multifunction printer used to come with an OCR facility but what is the easiest way now? For a while I\u2019ve used Microsoft OneNote for this, you just paste in an image, right-click, and there is a Copy Text from Picture option:<\/p>\n<p><a href=\"https:\/\/www.itwriting.com\/blog\/wp-content\/uploads\/2022\/01\/image-1.png\"><img loading=\"lazy\" decoding=\"async\" width=\"233\" height=\"244\" title=\"image\" style=\"margin: 0px; display: inline; background-image: none;\" alt=\"image\" src=\"https:\/\/www.itwriting.com\/blog\/wp-content\/uploads\/2022\/01\/image_thumb-1.png\" border=\"0\"><\/a><\/p>\n<p>This normally works OK but not this time. The results were not completely useless but included lots of errors; words missing and words wrongly recognised or scrambled. I am not sure, for example, how the word \u201cscore\u201d got recognized as \u201cscMe\u201d. <\/p>\n<p>So I looked for a better solution online, trying to avoid ad-laden free OCR sites of unknown quality. I found <a href=\"https:\/\/convertio.co\/about\/\" target=\"_blank\" rel=\"noopener\">Convertio<\/a> which has a straightforward introductory service with no registration or ads for the first 10 pages. It did a much better job with only 3 or 4 errors, text converted correctly to two columns in a Word document, and a table converted to a Word table. The main issue was that the text was tiny \u2013 4pt \u2013 but that was reasonably easy to fix up. It seems that it has a much better recognition engine than OneNote.<\/p>\n<p>I\u2019ll be inclined to use Convertio again, but it also seems that Microsoft has got behind with this little corner of Office 365. Perhaps it should do something based on its <a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/computer-vision\/overview-ocr\" target=\"_blank\" rel=\"noopener\">Cognitive Services<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I was emailed an attachment scanned from a magazine; it was a nuisance and I wanted to convert it to text. There are of course a million ways to do this and I recall that every multifunction printer used to come with an OCR facility but what is the easiest way now? For a while &hellip; <a href=\"https:\/\/www.itwriting.com\/blog\/11955-converting-a-scanned-image-to-text-in-office-365.html\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Converting a scanned image to text in Office 365<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1519],"tags":[],"class_list":["post-11955","post","type-post","status-publish","format-standard","hentry","category-tech"],"_links":{"self":[{"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/posts\/11955","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/comments?post=11955"}],"version-history":[{"count":1,"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/posts\/11955\/revisions"}],"predecessor-version":[{"id":11956,"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/posts\/11955\/revisions\/11956"}],"wp:attachment":[{"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/media?parent=11955"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/categories?post=11955"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.itwriting.com\/blog\/wp-json\/wp\/v2\/tags?post=11955"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}