<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/">
  <channel>
    <title>#tesseract posts — Ben Crowder</title>
    <link>https://bencrowder.net/blog/tag/tesseract/</link>
    <atom:link href="https://bencrowder.net/blog/tag/tesseract/feed/" rel="self" />
    <description>Feed for blog posts tagged with #tesseract.</description>
    <lastBuildDate>Sat, 04 Apr 2026 05:22:16 GMT</lastBuildDate>
    <language>en-US</language>
    <generator>https://bencrowder.net/</generator>

    <item>
      <title>Ancient Greek OCR</title>
      <link>https://bencrowder.net/blog/2014/ancient-greek-ocr/</link>
      <guid isPermaLink="true">https://bencrowder.net/blog/2014/ancient-greek-ocr/</guid>
      <pubDate>Tue, 04 Nov 2014 12:00:00 GMT</pubDate>
      <dc:creator><![CDATA[Ben Crowder]]></dc:creator>
      <description><![CDATA[<p><a href="http://ancientgreekocr.org/">This</a> is cool:</p>
<blockquote>
  <p>Ancient Greek OCR is free software to accurately convert scans of printed Ancient Greek into unicode text and PDF files, which can be easily searched, copied, archived, and transformed. It uses the excellent Tesseract OCR engine, tailored for Ancient Greek typography, syntax and vocabulary.</p>
</blockquote>
<p>I haven’t used Tesseract in 10+ years, but back then it wasn’t too great. According to <a href="https://github.com/tesseract-ocr/tesseract">their website</a>, however: “Between 1995 and 2006 it had little work done on it, but since then it has been improved extensively by Google.” That’s encouraging. (I wonder if that’s what they’re using behind the scenes for Google Books and Google Drive and their other things.)</p><hr class="feed-extra" style="margin-top: 48pt;" /><p class="feed-extra feed-mail"><a href="mailto:ben.crowder@gmail.com?subject=Re%3A%20Ancient Greek OCR">Reply via email</a></p>]]></description>
    </item>
    
  </channel>
</rss>
