
Asahi Kanji application for Android™
Single kanji Optical Character Recognition (OCR)
For a presentation of the whole application please see:
http://www.japanese-kanji.com/android/english/asahikanji-en-android.htm
This feature is obsolete and has been removed from the Asahi Kanji app in September 2025.
This was an experimental addition to the app, available between 2013 and 2025.
History
In this app OCR relied on Tesseract, an optical character recognition engine developed by Hewlett-Packard as proprietary software in the 1980s, released as open source in 2005 and further developed with the help of Google since 2006.
In 2017 Google introduced Google Lens, which is now much more efficient.
This is what we advise you to use now.
The old Tesseract files were too large (13.7 and 30.0 MB) to be distributed inside the APK and had to be downloaded separately. In order to do that , the app had to require several permissions like:
android.permission.INTERNET (to download the files)
android.permission.ACCESS_NETWORK_STATE (is the device connected to the web?)
android.permission.WRITE_EXTERNAL_STORAGE (to copy the downloaded files onto the device)
android.permission.CAMERA (for optical recognition)
To the user these permissions sounded too frightening "Do you want Asahi Kanji to take photos and videos?"
The app did neither. It just had to see the kanji to have it analyzed by Tessaract. No photos, no videos were taken.
These permissions also sounded suspicious to Google.
Here is the warning in my developer console:
Android 15 (September 2024) introduced even stricter requirements concerning permissions and native C librairies (the code used for the OCR functionality).
"Any existing app updates that don't support the new requirement will be blocked from publishing."
The following is kept as an archive:
How to use:
Limitations
Using this OCR feature is like using a cheap digital camera which can take excellent pictures when the conditions are right,
but takes poor shots in non standard situations.
You have to learn how to get the best of it.
Do not give up too quickly.
Beside the size and clarity of the target kanji, the type of font seems to be a major factor for a successful analysis.
Regular, standard (that is “skinny”) fonts give an excellent recognition rate, close to a 100%.
With bold fonts the recognition rate drops down. Forget about fancy calligraphy but "neat",
scholarly calligraphic glyphs like those used in
Henshall's "A Guide to Remembering Japanese Characters" are perfectly recognized.
The other activities like the kanji drills rely on Android OS only and can be thoroughly tested.
Unfortunately, it is not the case with the OCR function because it uses a camera whose capabilities
and specifications vary greatly depending on the device model.
Google keeps track of the number of devices on the market that should be compatible with each application, according to its requirements (like a camera, a touch screen, etc.)
On September 16, 2013 (Asahi Kanji version 1.4.0 release date) 3553 devices were listed.
Cameras can be mounted differently on smartphones and tablets,
their "natural orientation" varies and what you see on the screen is generally not what the camera sensor sees.
Therefore, the kanji image could be sent sideways or upside down to the OCR engine without you noticing it, resulting in a failed recognition.
Some camera models are notorious for certain bugs, fortunately apparently non fatal (they do not crash the app).
Your feedback, whether positive or negative, will be essential for the future development of this function.
The OCR function was developed and tested on a Google Galaxy Nexus, a standard device recommended for Android software development.
What is the use of OCR limited to a single kanji?
The Asahi Kanji app was designed as a tool to help people review or memorize Japanese kanji.
For those who are lucky enough to study in Japan, the OCR function will be helpful.
Kanji are everywhere around us in the streets, trains, shops, on signboards and posters...
More than often a kanji that we “almost know” or “definitely knew but have forgotten” catches our eye.
We just need a “booster shot” at the right moment, in the right environment, for the character to stick in our mind.
This app tries to give you this possibility.
Credits:
The OCR module uses the Tesseract OCR engine, released under the Apache License, Version 2.0
The OCR module user interface is based on an open source project developed by
Robert Theis
Permissions:
Asahi Kanji does not collect nor transmit any information.
Up to version 1.3 this application did not require any particular permissions to run.
This Optical Character Recognition module (OCR) was introduced in version 1.4.0 and several permissions are now required.
When you downloaded an update or the app for the first time, Google Play used to display the following dialog box.
Today's messages will look different, as the Google Play interface keeps evolving all the time.
The wording like "full access", "view", modify or delete", "take pictures and videos" can understandably be frightening,
although these actions are now standard on smartphones.
Network
"full network access"
Network access is needed by the app in order to download two large files from googlecode.com
These files are the data needed by the Tesseract OCR engine to recognize Japanese characters.
"view network connections"
When you decide to download the files mentioned above,
the app has to check if you have Internet access over wifi in order to display a message when it is not the case.
Storage
"modify or delete the contents of your USB storage"
The app needs to be able to copy the downloaded tarred (zipped) files to your USB storage and uncompress them.
If the download process fails or is cancelled by the user, the app needs to delete the partially downloaded files before the next try.
Camera
"Take pictures and videos"
The Optical Character Recognition module needs access to your camera.
It does not take pictures nor videos though.
It just extracts the kanji image from the camera preview before sending it to the OCR engine.
As for Asahi Kanji JLPT-N5, it is a free demo that does not include OCR, does not require license checks nor any permissions to run.
It is also ad-free (no banners).
For further details send a message to support@japanese-kanji.com