I am a bit confused about this topic. I have done a little object detection for the video.
Should I summarize according to the objects detected in the video or should I extract key frames from the video that give a good idea about the content ?
I did search for this on the internet ... I found this
But still I want to know how should I proceed ?
Thanks!
There is no answer to that question, this is an open research topic! Pr. Bernard Merialdo from France has been studying this topic for several years. You can have a look at his research group page and publications.
Related
I understand so far that AKSampler was recently rewritten and this GitHub project seems to be the defacto guide on the new AKSampler. What I can gather is a move toward SFZ format. I am new to the sampling world but in my application I only need a handful of samples recorded from my piano in order for it to work. As I have looked around with existing SFZ formats and samples, I do not need all of the complexity and features that SFZ provides.
I am currently using AKSampler with a single piano sample which works perfectly, however it gets a bit weird once I play anything too far from the original source, so I just want to fill in the gaps with a few other samples (I only need to play around an octave and a half with my current app).
I do see according to the Docs a couple methods buildSimpleKeyMap() and buildKeyMap() however there is no implementation currently.
Do I have any additional options? I know that EXS format has been deprecated, as well as SoundFont. Is the only way to map multiple samples to AKSampler currently using SFZ?
Thanks for all your help <3
Edit: This readme on the AKSampler GitHub page provides the breakdown for samples. I still only see SFZ being considered. If anyone else is lost with my question or needs a reference, this seems to be the best resource. If the current AKSampler only offers SFZ as the primary way to map multiple samples, so be it, however it does look very challenging, I'm really hoping there is some simple middle ground between only using a single sample for the AKSampler vs. a full bore SFZ file.
Edit 2: Getting a solution to this, will update as soon as possible, thanks for your patience!
I have provided a simple explainer and sample file in the AudioKit docs. Hope this helps new users of AudioKit!
I'm using Apple's Core ML to visually recognize items in an image but it's returning incorrect answers sometimes identifying shoes as a knife etc. Is there a way to provide feedback about CoreML and hopefully guide it towards correctly identifying the items in an image?
You're probably giving the Core ML model inputs that it does not expect. I wrote a blog post about the most common mistakes: http://machinethink.net/blog/help-core-ml-gives-wrong-output/
I would open a feedback ticket at https://developer.apple.com/bug-reporting/
Apple is really glad to get devs feedback. Try to detail yours as deeply as possible :)
EDIT : I would also suggest that you try another CoreML model ! I had a few tries with Inception V3 which worked like a charm with my apps. https://developer.apple.com/machine-learning/
The problem is that i want to compare two sounds together and check how much similarity is in between two Voice/Sounds.
Example : There is "A" pronunciation already stored in app and the user say/record "A" in his/her voice and then we compare both and give result how much % they are same.
I searched GitHub and stack overflow for answer but didn't got any authentic and proper solution for this.
Can anybody share any library or code snippet for help.
Thanks in Advance
Blog vs youtube revenues? This is the question.
I have some solutions to IT problems that i know they are unique and unsolved, as i searched all over the place and didn't find answers.
Now i want to share this and i wonder how it would be the best way to do this considering revenues? I guess i can resume it to adsense vs youtube?
I have no ideea which is more profitable, or if there is any other way which i didn't think of and would be more profitable than those two.
Please consider i do not have too much time to invest in for ex finding contracts to advertise independently on my blog let's say. I want to write it (blog), or post it (youtube) and forget about it, or do minimal maintenance (1h/week).
Thank you in advance people :)
Put your ideas on your youtube with a link onscreen and on the description to your blog. This way your ideas will have more visibility.
I'm Alexander Mashkovtsev, student of gymnasium "Akademy", Kyiv, Ukrane. I'm 15.
I'd like to do face recognition program using OpenCV.
I write science work about face recognition, too.
It's very interesting for me, so i search a command.
I'd like to demonstrate the work on Kyiv High-Technology Center to get help with this.
There are people who are ready help me to create this program?
I will be grateful. Also ready to to reward the person who will help me.
Thanks!
have a look at the opencv facereco docs
or, here for a small python demo (yea, i 've seen your other questions here, that's why i'm posting the latter).
but ofc, you want to write your own, if i understood that right, that's great!
It seems that Face++ SDKs are more easier than OpenCV.
You can refer to Face++ website, look through their API docs overview.
Good luck!