I am working on speech recognition for Indian Accent. For better recognition, I want to create language model for Indian accent.
The tutorial I got describes about Linux OS. Is there any way to do in Windows for acoustic model adaptation ?? Is there any alternative other than recorded sound to create a acoustic model ??
I found online language model tool here >>http://www.speech.cs.cmu.edu/tools/lmtool.html
But it is US accent.
Is there any online tool to create Indian/UK accent language model ??
First a quick clarification. Creating a new language model will not help you with accents. The language model only specifies in which orders the words can appear. You are looking for acoustic, not language, model adaptation.
Because adaptation is a complex process, there are not good online tools for it (the one you linked is a language model tool). I would recommend downloading SphinxTrain at http://cmusphinx.sourceforge.net for a good all-purpose adaptation/training tool. There are many tutorials and forums to help you get started.
Related
I am currently looking for a programation language that allows me to create my own diagrams.
You can find attached the types of diagram I would like to create.
Speed variator block diagram
Antenna signal reception
Antenna transmission/reception
Algorithm
What language could you recommend me ?
I think LaTEX is a good language to do this right ? If so, could you recommend me courses to learn this language ?
Thank you in advance,
kind regards,
Marco
For now, I didn't tried anything except online diagram software, but those don't allow me to do what I would like...
I watched a YouTube video about voice cloning: https://www.youtube.com/watch?v=Kfr_FZof_hs
It's an interesting topic, but this project's repository only supports English: https://colab.research.google.com/drive/1NxiY3zHN4Nd8J3YAqFsbYaOB71IiLE04?usp=sharing#scrollTo=JrK20I32grP6
I want to adapt it for Italian.
I am a beginner in machine learning.
What do I need to do to get TTC to "learn" Italian?
Is it necessary to train the model on audio files or rebuild the model, or what needs to be done?
Can you advise me)
You can check the following where the creator answers this question.
Creator's answer:
Here is what you need to train this:
a wav2vec or similar asr model for your language
at least 10,000 hours of usable spoken language, with no environmental noises, music, etc. This does not need to be transcribed. I used audiobooks and podcasts for english.
approximately 16 months total of v100 time
https://github.com/neonbjb/tortoise-tts/issues/5
I have read the documentation for the Roboforth environment from STrobotics and recognized that this a nice way for programming a robot. What I missed is a sophisticated software library with predefined motion primitives. For example, for picking up a object, for regrasping or for changing a tool.
In other programming languages like Python or C++, a library is a convenient way for programming repetitive tasks and for storing expert knowledge into machine-readable files. Also a library is good way for not-so-talented programmers to get access on higher-level-functions. In my opinion Forth is the perfect language for implementing such an API, but I didn't find information about it. Where should I search? Are there any examples out there?
I am author of RoboForth, and you make a good point. I have approached the problem of starting off new users with videos on YouTube; see How to... (playlist with 6 items, e.g "ST Robotics How-to number 1 - getting started") which is a playlist covering basics and indeed tool changing.
I never wrote any starter programs, because the physical positions (coordinates) would be different from one user to the next, however I think it can be done, and I will do it. Thanks for the heads up.
I read a few papers about machine translation but did not understand them well.
The language models (in Google translate) use phonetics and machine learning as best as I can tell.
My question then becomes is it possible to convert an Arabic word that is phonetically spelled in English to translate the users intended Arabic word?
For instance the word 'Hadith' is an English phonetic of the Arabic word 'حديث'. Can I programmatically go from 'Hadith' to Arabic?
Thanks the Wiki article, there's an entire field of work in the area of Transliteration. There was a Google API for this that was deprecated in 2011 and moved to the Google Input Tools service.
The simplest answer is Buck Walter Transliteration but at first glace a 1:1 mapping doesn't seem like a good enough idea.
I am going to try to see if there's a way to hack the Google Input tools and call it even at CLI level because their online demo works very well
I am using stanford ner for removing the identity from the essays.
It is detecting the names like Werner..But indian names such as ram, shyam etc. goes undetected.
What i should do to make them recognizable.
You should train NER for Indian names. I could not find detailed information for how to achieve that. But this FAQ page ( http://nlp.stanford.edu/software/crf-faq.shtml#a ) has some information which may be a starting point for you. Especially the questions 2-3 are directly related to your question.