iOS - Speech to text, dictation - ios

I have to develop an app which allows user to speech text, but there are some APIs in the iOS SDK which allow that? Or even third part libs? I have only one problem : the app is in italian language, so a third part lib has to support italian language.

As of iOS 10 (which isn't released at the time of writing), there is a new speech recognition API. It supports over 50 languages and dialects.
If you are looking for a solution, I'd strongly recommend this over a third party library... Apple will keep it up to date, support it, and improve it. You have none of those guarantees with a third party library.
https://developer.apple.com/reference/speech/sfspeechrecognizer

Any UITextField or UITextArea can be dictated using the built-in dictation feature (in devices that support it of course).
An API is not available and you cannot use Siri directly for that.

As of this post there is no way of doing this using native iOS SDK. However, you have the IBM Bluemix Speech-to-Text service available to you for free:
http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/speech-to-text.html

Related

What API/SDK to print automatically to thermal printer (e.g. Brother QL) with my React Native iPhone app without interaction?

I've been looking into different API/SDKs that would be best and easiest to integrate with my React Native iPhone app. This is an internal employee app, that once a form is submitted I would like to automatically print out a label from a thermal printer (e.g. Brother QL1110NWB).
Some options I found are:
React Native:
https://github.com/christopherdro/react-native-print
Brother SDK:
https://support.brother.com/g/s/es/htmldoc/mobilesdk/
Apple AirPrint:
https://developer.apple.com/documentation/uikit/uiprintinteractioncontroller#//apple_ref/doc/uid/TP40010141-CH1-SW34
Does anyone have experience with any of these API/SDKs and can recommend best method for easy integration and be able to print directly without interaction?
Thank you!
If you can dictate the hardware to make sure it is compatible, then the Brother SDK will have the easiest integration and be transparent to the user.
Apple restricts Bluetooth Classic devices from connecting to iOS unless they are MFi certified, which adds to the price but enables the External Accessory Framework at the OS level. Device manufacturers can then sell a printer that is 'iOS certified' like the Brother QL1110NWB and provide the SDK to use it.
How to use bluetooth classic instead of le
You would need to write a React Native binding for each target device family to use the SDK's OS specific libraries. Brother has an iOS, Android and Xamarin SDK available, but I don't see one for React, so you would need to wrap their iOS SDK with hooks to invoke your print on demand. Before you start, contact Brother support or your sales rep, because they may be working on React Native SDK, or your organization may have enough buying power to request one. Either way, it's a good practice to make this an interface at the base React level so that new printers or platforms can be added just by implementing custom native device adapters.

Is speech to text functionality and voice analysis possible in ios 8 and above

I need to integrate this functionality on the application I am currently working on.
Though I came to know about OpenEars but do we have support for iOS 8 and 9.
Please do suggest any other latest and reliable apis for voice recording and analysis.
Yes. OpenEars support the new iOS8 and iOS9. OpenEars just need AudioToolbox and AVFoundation framework to be added.
There are other API's
Free/OpenSource:
http://www.ispeech.org/
http://nuancemobiledeveloper.com/public/index.php
Paid:
http://www.creaceed.com/ceedvocal/about
Speech to text is supported natively with the class AVSpeechSynthesizer, available since iOS7. AVSpeechSynthesizer is a "free" TTS system integrated iOS: unless you really need specialized voices, e.g. child voices, this is your best option for speech synthesis.
The only "non online" speech recognition SDKs which I am aware of is CMUSphinx written in C and which supports very few speaker languages. You will probably need to adapt it yourself for iOS.
There are several other APIs, available as paid services. Your mileage with API based assistive speech services will vary.

Using twilio sdk for an iOS app?

I am planing to develop an VoIP iOS app and use Twilios SDK. I am making the choice to either use LiveCode, Appery.io, PhoneGap or build a native Objective C app. I am going to build the app for iOS, Android and HTML5 so the ideal would be to develope in JavaScript for all platforms, but as I understand the support for WebRTC is laking on the iPhone so the alternative for iOS is the native twilio SDK.
My requirements is:
be possilbe to use in iPhone 5 with iOS 7 be able to use twilio iOS
SDK´s voip functionality or twilio´s js SDK (if it is possible to
wrap a browser that supports RTC in the code?) be able to integrate
billing such as in-app payment or paypal with zooz or similar
communicate with REST API´s such as Amazon S3 or a node.js server
store temporary info in a SQLLite db when app is off line make fast
and responsive views (file listings etc) is very important
create cfuuid´s
I have seen several Twilio projects that use PhoneGap but none that are using LiveCode.
I have already built an iOS VoIP app in Objective C, but I want to be able to release it on several platforms also such as for Android and build a HTML5 app, without redoing everything.
This isn't really a programming question and should perhaps not be asked here.
You can create an external for LiveCode and quickly create an interface using the LiveCode IDE. This is probably a quick and easy way to make a working app. If you're starting with LiveCode but are experienced in Objective-C, creating an external won't be a problem for you.
LiveCode doesn't contain native iOS controls, which means that you have to emulate the GUI. If you use PhoneGap, you also will need to compile a plugin for PhoneGap using Objective-C, but you can use a framework, such as JQuery, to get the right GUI.
Either way, you will have to compile the SDK and you'll need to be quite profound in Objective-C.
LiveCode will meet all your requirements. However, Apple will deny your app if you use PayPal for in-app purchases. You'll have to use Apple's in-app purchasing feature. I believe this is possible in LiveCode now. I'm not sure how easy it is.
I'm not sure about file listings either. On iOS, you won't have complete access to all files on the phone. This isn't a LiveCode limation but a limitation of the OS.

Adding text to speech functionality to iOS app

I'd like to implement such functionality in my app, and AFAIK, the only choice is to use a third-party library, so I've been looking for possible options. I've found that OpenEars seems to be one of the best free and open-source libraries for that purpose, and it works offline, but it only supports English language and I'd want to offer multilanguage text to speech, at least for English, Portuguese, Spanish and French.
I've read that, for multilanguage support, Google TTS performs well, but it only works online... and I'm not sure if a limit of calls exists, does it? Where could I find detailed info about using Google TTS in iOS?
And finally, I've also read a lot of references to Flite or Festival Lite... but it doesn't seem to support multilanguage either. Does somebody know a free, open-source TTS library for iOS that supports several languages and, preferably, works offline?
Just another question. The app I'm working on may have a commercial version and it is intended to be submitted to the Apple Store. Could the use of any TTS third-party library be a problem?
Thanks a lot
iOS 7 has AVSpeechSynthesizer that can do text-to-speech out-of-the-box without the need of any external frameworks.
For more on this, visit this link.
There is two API's which supports these languages:
ispeech
acapela
But both are online and paid.
OpenEars is a offline text-to-speech and speech-to-text opensource library. But I don't know whether it supports other languages other than English.
Could the use of any TTS third-party library be a problem?
Answer: No, there is no problem with this.
Update (on 27 Aug 2013):
OpenEars now supports Spanish as well as English. (Check the below comment from Halle)

How to implement text to speech functionality in iPad?

Is there any 3rd party API which will provide text to speech facility??
Yes.
The CMU Flite (Festival Lite) speech synthesis library has been ported to iOS and appears to be used in many iPhone and iPad apps in the App store.
Yes. Follow this link for your reference. You can use this as you require. There is iPhone sample in there, but it is easy to convert it to iPad.
And there is also other way with google api using following way.
http://translate.google.com/translate_tts?tl=en&q=Yourtext. But it has certain limitation like 100 chars and also you have to look for commercial usage conditions.
This you can load in UIWebView passing your text.
Hope it helps.

Resources