OwlOCR



OwlOCR offers simple optical character recognition of text in PDF files, images or on-screen and converts that to plain text. All conversion is done securely on-device - none of the data are sent to third-party services in the cloud as most applications do. Same problem domain as in action extensions, but didn't really see specific answer there. Also, I need to have a UI for my app, so removing UI (the one I provide) isn't really an option. OwlOCR offers simple optical character recognition of text in PDF files, images or on-screen and converts that to plain text. All conversion is done securely on-device - none of your images or files are sent to third-party services in the cloud as most applications do. MAU went down briefly after the crazy days in late 2020 where lots of people tried the app and for some of them it didn't stick of course. I turned lockdown into a side project and why you should too (owlocr.com) 2 points by frankbyte 89 days ago past Show HN: New website for a Mac screen text recognizer ( owlocr.com ).

  1. Owlcrate
  2. Owlcorn Hatchimal

Digitize images and PDFs to searchable PDFs using OCR right on your Mac.

4.7/5.0 average
OwlOCR
Works best on MacOS 11 Big Sur

How does it work?

OwlOCR
1. Press Cmd+F1 or select screen capture from the menubar.
3. Paste text (Cmd+V) anywhere.
Optical Character Recognition (OCR) is used to extract text from the screen area.
The processing is done right on your Mac for near instantenous results.

Privacy secured

Your screenshots and files never leave your machine.
The text recognition algorithm that is used has been developed by Apple and is a part of your MacOS operating system and is only run on your device.
We never ask for or require any of your personal information or your data.
OwlOCR is the only app for the Mac that will allow you to inputvia screenshots, image or PDF files or even using your iOS device camera.
OwlOCRAny inputs can be exported to searchable PDFs.
1. Drag and drop the non-searchable files into OwlOCR - or take screenshots / iOS scans.
OwlOCR
3. Exported PDF includes invisible text layers that can be searched by most PDF readers.
MacOS 10.15 Catalina or newer required

Core

  • Unlimited high quality text recognition on-device securely and incredibly fast
  • Grab a screen area for instant OCR and results copied to clipboard
  • Grab multiple areas of your screen for instant OCR and concatenated results copied to clipboard
  • Use Finder Extensions to trigger main use cases right from Finder
  • Drag and drop documents to the app to convert to text
  • OCR page-by-page in multi-page documents
  • Take a photo or scan documents with your iOS device, directly from the app
  • Export searchable PDFs
  • Export OCR results to text files
  • Custom dictionary
  • Line spacing and word wrapping options
  • Keyboard shortcut Cmd + F1 (configurable) for a screen grab
  • Keyboard shortcut Cmd + F2 (configurable) for multiple screen grabs
  • Notifications on OCR completion
  • Input filetypes supported: PDF, PNG, JPEG, GIF, HEIC
  • Language support under MacOS 11: English, French, Italian, German, Spanish, Portuguese, Traditional Chinese and Simplified Chinese.
  • Language support under MacOS 10.15: English.
  • Many customization options like Menu Bar -only mode, screenshot sounds, storage directories and more
  • Apple M1, Dark Mode and MacOS Big Sur support

Pro

  • All Core Features
  • Batch OCR multi-page PDFs or multiple images files in a single fast run, using all your CPU cores
  • Batch process multiple files in a single go right from Finder using Finder Extensions
  • Use multiple screenshot feature to capture up to 20 concatenated screenshots
  • Configure post-processing rules for Find/Replace behaviour and Smart Quotes substitution
  • Karma boost for supporting indie Mac apps
3 week Pro trial is automatically provided on installation.
No sign-up required.
  • < Previous
  • Next >

College of Arts and Letters Posters

Title

Presenter and Co-Authors

College

College of Arts & Letters

Department

OwlOCR

English

Program

Applied Linguistics

Publication Date

4-2021

DOI

0000-0002-6942-6689 (McCullough)

Abstract

Ladino (or Judeo-Spanish) is a Diasporic Jewish language spoken by Sephardi Jews. There is little existing scholarly research on Ladino, nor does it have many language learning materials. These two factors compelled me to create the Aki Yerushalayim Corpus. The initial Aki Yerushalayim Corpus of Modern Written Ladino (currently ~7,000 words) was not created to act as a reference corpus of Modern Ladino. Rather, it was created to study the composition of Ladino prose and demonstrate the utility of this type project in the subdiscipline of language documentation. In addition, the project’s focus on cultural essays and narrative prose allow for insights into how Ladino writers construct their identities through word choice.

The research questions that directed the creation of the corpus are as follows: 1. What substrata feature most prominently in the Ladino lexicon? 2. Do the borrowed words belong to a particular semantic domain? 3. What parts of speech are most/least frequently borrowed?

To address and answer these questions, the gathered texts were first scanned with an optical character recognition software (OwlOCR) before being proofread for any mistakes. They were then tagged for Part of Speech and for language of origin and run through a concordancer (AntConc).

From this brief glimpse at Ladino prose in the late 20th and early 21st centuries, the results are as expected: the language is Spanish-based, and the most frequently-used substrata are Biblical Hebrew and Turkish, primarily used for religious and secular/cultural words, respectively. This pilot gives a snapshot into how authors represent their identity as Sephardi Jews in both cultural essays and narrative prose, and further study of the distribution of these loanwords will allow linguists to understand how language contact and a speech community’s history can provide context that illuminates which loanwords are most frequently adopted and why that may be.

Disciplines

Applied Linguistics | Jewish Studies

Files

Recommended Citation

McCullough, Rachel, 'The Aki Yerushalayim Corpus: A Study of Loanwords in Ladino' (2021). College of Arts and Letters Posters. 5.
https://digitalcommons.odu.edu/gradposters2021_artsletters/5


DOWNLOADS

Since April 01, 2021

Included in

Owlcrate

Applied Linguistics Commons, Jewish Studies Commons

Owlcorn Hatchimal

COinS