iOS:Image to string conversion - ios

I want to develop a feature, wherein user can take a picture of a serial number which has printed in a product or any things, using native camera and then scan the picture to retrieve the serial number as a string in my iOS program. I am not getting how to achieve this. Could someone please guide me to solve this problem? I don't want to know the name of the product and all after taking the picture, i want to get the string of the serial number taken from the photo.
Thanks in advance!

You need OCR (Optical Character Recognition), and that is not something to take lightly..
Take a look at Tesseract for iOs, this is a library that does this.. But be ready to dive into unknown realms.

Related

ABBYY OCR library recognition only number IOS?

I am creating number recognization in ios on real time. I have found one ABBYY OCR library This is very good and fast but this is recognization both text and number but I want to only number recognization. Please help me. how can I do this?. From StackOverflow I have found one hint we can use English.edc file from dictionaries But this file is lock and can't able to open. What is the solution? please share your valuable knowledge with me .

What is the way to parse a string of a well known format from an image on iOS (some library created specifically for this purpose)?

Local travel cards in Saint-Petersburg, Russia have got huge id numbers that aren't easy to read and type into a web page when topping up the card online. So I want to build a small app that would take a photo of a travel card and parse the number out.
The task is a bit easier than a free form recognition:
card is of the very well known size
id numbers are of known size, are located in the very well known location on a card and they are number only, no letters (okay, there are two variations I think and maybe they will add 1-2 more in the future)
even the font is known in advance
even the first several numbers are the same for most of the card (so far there are only two prefixes used)
How would you do it? Are there any libraries tuned not for the general OCR, but for a "hinted" OCR like I need?
Best regards,
Artem.
P.S.
Actually a free/cheap web service for this task would also be good enough
Yes Google has a library called Tesseract and there is an iOS SDK on Github you can import into your application. So you can use this SDK and it has some documentation that you can read that will explain how to set it up in your app. It has methods that will return you a string with the text of the card in the string. BUT it will be ALL of the text from the card. So best thing to do would be to:
1 "clip" the original image to extract a sub image that displays only the portion of the card you wish to get the numbers from.
2 Process this sub image through Tesseract to retrieve the string you are looking for.
3 Then parse through the string and pick out the data that you need.
But just be warned, it can be a bit quirky. This SDK tends to recognize words best from images that are scanned, not "taken a picture of". Because although it is an advance piece of technology, it isn't perfect. So to get it to work as perfectly as possible for you, try to get scanned copies of the originals.
Best of luck.
The ideal solution for you would have three components:
1) Detection of the card. This is useful because if you have the detection, then the end users have much easier time actually using the scanner, because they can place the phone above the card in an arbitrary direction
2) Accurate OCR component. Ideally, customizable for this exact font you have on the card, for the exact position on the card.
3) Parsing mechanism. This would enable you to obtain the exact string written on the card without writing huge amount of OCR parsing code.
BlinkID SDK has all this. It has a preset for detection cards in the ID-1 format. It has integrated OCR engine. And it provides RegexParser, where you can define the exact format of the text which you're trying to extract from the document.
BlinkID was initially built for scanning ID documents which have very similar properties as the problem you're trying to solve.
Note. I'm one of the developers working on BlinkID.

Image to Text Conversion - Rails 4/5

I cant seem to see anything in my searches.
I am trying to find a Rails 4 or 5 gem/plugin that acts similar to when you hold the camera to an app store card and it converts the text on the card to an object.
in my case i want a user to submit a photo and it reads two boxes in the screen shot / photo and converts the text in the image to text to save to an appropriate field and saves on from submit..
is there anything like this out there or am i just thinking a bit too far out of the box here..??
thanks.
Hope you're ready to invest a lot of time and probably money.
What you're looking for is OCR (Optical character recognition). ABBYY makes finereader which is a business solution for OCR. It's probably the best you can hope for at this point. There probably are quite a few open source solutions out there that will work more or less well.
Check out this project (or find similar ones): https://sourceforge.net/projects/tesseract-ocr/

How hard is it to build an iOS application like this?

I am a Web Designer and I don't know much about programming, well, I know literally nothing about iOS programming.
And I need a iOS app that can do these things for me:
It will take photos.
It will ask for some inputs from photo-taker like Name, E-Mail
etc.
The program will change the name of the taken photo with inputs
which program took in the first place, like name and e-mail. For
example, IPhone21313.JPG ---> Leo_Maxim__leomaxim#example.org.JPG
Then it will send the photo to a specific folder in a web server
like a FTP client.
Is it possible?
Even if it is possible, how hard is it to build some sort of a program like this one? Could any average iOS programmer accomplish to build this kinda program?
I know it was a long question, but I need your help, really.
Thank you!
Take a picture:
http://www.techotopia.com/index.php/An_Example_iOS_7_iPhone_Camera_Application
Get input:
http://code.tutsplus.com/tutorials/ios-5-sdk-uialertview-text-input-and-validation--mobile-8388
http://code.tutsplus.com/tutorials/ios-sdk-uitextfield-uitextfielddelegate--mobile-10943
Save the image:
http://iosdevelopertips.com/data-file-management/save-uiimage-object-as-a-png-or-jpeg-file.html
FTP Example:
https://github.com/nkreipke/FTPManager
Bottom line, it's extremely easy.

Use both GPS and Scan function in same channel with Junaio AR

As the title says, im simply wondering if it is possible to use the GPS tracking and POI part of junaio, and att the same time use the scan functionality to scan and recognize images. Im working with a group at a project which demands that we use both functionalities, and we are at the moment stuck on trying to send 2 XML documents, causing the server to return nothing at all. I simply want to know if it is possible to use both functionalities in the same channel, and I would greatly appriciate if someone would point me in a direction which could help me solve our problems, since I've been able to find absolutley nothing on my own. Thanks beforehand!
Scan + GPS/compass is not possible at the moment.
However, it's possible to use GPS/compass tracking and continuous visual search at the same time. This might be the closest thing to your requirements.
You might find more information on http://helpdesk.metaio.com

Resources