I am designing an English dictionary-like app and using OpenEars TTS for pronunciation but the voice quality is not so good. Any suggestion to improve its sound quality?
If you are supporting iOS7 and above, you can consider using AVSpeechSynthesizer.
What you can do is contact them directly and state your problem. Here is the link to their contact site: http://www.politepix.com/contact/
The best way to get your question answered is in their forums and here is the link for that: http://www.politepix.com/forums/forum/openearsforum/
They also have a private support service but that will cost you some money but here is the link for that: http://www.politepix.com/shop/openears-support-incident/
Related
Would I be able to change the speaking voice of OpenEars to another? I don't quite like the the default one. Is that possible or would I have to use another API. Sorry if this is a stupid question.
Yes, you can, but it's a bit complex process. Openears uses Flite for speech synthesis, so you need to change the voice in Flite. Flite supports 13 voices to choose already and you have an option to build a new voice.
To build a new voice you need to follow the documentation. A festvox documentation might be also useful for you to understand the basics.
I've been researching several iOS speech recognition frameworks and have found it hard to accomplish something I would think is pretty straightforward.
I have an app that allows people to record their voices. After a recording is made, they have the option to create a text version.
Looking into the services out there (i.e., Nuance) most require you to use the microphone. OpenEars allows you to do this, but the dictionary is so limited because it is an offline solution (they recommend 300 or less words).
There are a few other things going on with the app that would make it very unappealing to switch from the current recording method. For what it is worth, I am using the Amazing Audio Engine framework.
Anyone have any other suggestions for frameworks. Or is there a way to dig deeper with Nuance to transcribe a recorded file?
Thank you for your time.
For services, there are a few cloud based hosted speech recognition services you can use. You simply post the audio file to their URL and receive back the text. Most of them don't have any constraint on the vocabulary. You can of course choose any recording method you like.
See here: Server-side Voice Recognition . Many of them offer free trial as well.
I am creating an iOS game in which I have to inform user about events in the game with voice, that you have moved one piece, 2 pieces or well done you have performed well.
The problem is that voices are in large amount and if I replace audio files for each voice the app size will grow very large.
Second option I have discovered is to use text-to-speech library. I have tried "OpenEars" but the issue is I want voice like cartoon character or bird like which is not available in any of open source text-to-speech libraries as far as I have searched.
Can anybody suggest me what is the better way to handle it or any text-to-speech framework with different voice capabilities as mentioned in above paragraph.
Thanks in advance.
VoiceForge offers different TTS voices.
http://www.voiceforge.com
What frameworks are required to detect how loud someone is talking into a microphone... Also, can anyone tell me what to search for in the documentation or google so I can create some code... What line of code would be commonly used when detecting volume levels of noise through the microphone? Thanks!
This seems to be a duplicate of:
Realtime microphone sound level monitoring
However, that question is old and the accepted answer links to a deprecated library. They now recommend that you instead use AVAudioRecorder. They suggest this tutorial and it seems to be what you are looking for.
I'm having trouble locating BlackBerry techical specifications and their website is a mess. They also don't have a number that I can use to easily contact them.
This isn't exactly a coding question, but what does the BlackBerry audio API look like, and where can I get technical specifications on audio? Specifically, I'm trying to find out more information on Audio-In, specifically, through the Mic-In on the 3.5 mm jack.
Unfortunately, before I can proceed, I need to know such things like sampling rate, data width, etc.
Direction to the right resource or if you know off of the top of your head is appreciated.
I couldn't agree more, the BlackBerry site is a maze!
You can find the API here.
I am not sure about the audio side of BlackBerry I'm afraid but check out
the BlackBerry support forum as there are some guys who work for RIM on there.
How about this post?
I am sorry I can't help more.