How do I start a streaming service? - stream

I know it is an open question, but I'll like the input from the community on where to start. I know the basics but my strength is on the front end side of things, so this a bit out of my field, but I want to learn.
What I want is to create a streaming business that can run 24/7 with a payed fee to see the feed from any of the cameras I own.
I know I'm gonna need a paid Live Website Streaming Video Platform (or create my own, don't think this will be an option).
Any tips you may have a re welcome, thanks.

Related

YouTube Studio Multiple Camera Angles

YouTube recently forced everyone to the new YouTube studio for live streaming. Per their documentation here https://support.google.com/youtube/answer/2853812?hl=en the only place to setup Multiple Camera angles is on the Events page. However, since their change I can't access Classic Streaming anymore, it just pop ups a message saying
Live Control Room is the new way to go live.
I've tried reaching out to YouTube directly - I'm stuck in a support loop of uselessness. Has anyone else seen this and found a workaround?
Direct email from YouTube:
I just want to make all a clarification regarding your concern why you can't find to stream with multiple camera angles.
Due because of low usage and since it can't be watched on mobile,
multi-camera won't migrate to Live Control Room right now. Rest
assured we are looking for a better version of this product in the
near future.
We also recommend you to send a feedback to our Product team so they
can look into your suggestion.
Thanks for your understanding on this matter. Let me know if you have
other questions.

All factors when live streaming in Swift?

I am attempting to make my app, this is an app that will integrate live streaming that could be accessed by anyone that is signed up. I have been searching and the information is very vague.
What factors do I have to take into consideration to make an IOS app with live streaming?
I would have no clue how to implement any of this and I haven't seen much documentation about how to implement this. What third party software would I have to use, would I need to create something in a different programming language?
This app is going to be only for IOS for now.
Thanks,
The question is, how many users? How much do you want to spend? How critical is performance/quality/downtime, etc.? Is this a business app?
Due to having multiple users connect to one stream, I would suggest that the streamer streams it to a server, whereby the server deals with the bandwidth requirements to clients, allowing for it to more easily scale. If it is a one-on-one or up to maybe one on three, you could still think of p2p for a social stream (e.g. streaming to family) but not e.g. a business conference (unless you want them to have e2e encryption, avoiding server, etc. which is usually complete overkill, though do implement encryption if possible, quite easy using libraries).
Now if you're asking coding wise, look for some tutorials, e.g. https://www.raywenderlich.com/5191-video-streaming-tutorial-for-ios-getting-started is quite nice, it's quite straight forward and will probably do what you want for the client accessing it. Use the AVFoundation Playback https://developer.apple.com/av-foundation/ it has links to HTTP live streaming and how to implement it as well.

shazam for voice recognition on iphone

I am trying to build an app that allows the user to record individual people speaking, and then save the recordings on the device and tag each record with the name of the person who spoke. Then there is the detection mode, in which i record someone and can tell whats his name if he is in the local database.
First of all - is this possible at all? I am very new to iOS development and not so familiar with the available APIs.
More importantly, which API should I use (ideally free) to correlate between the incoming voice and the records I have in the local db? This should behave something like Shazam, but much more simple since the database I am looking for a match against is much smaller.
If you're new to iOS development, I'd start with the core app to record the audio and let people manually choose a profile/name to attach it to and worry about the speaker recognition part later.
You obviously have two options for the recognition side of things: You can either tie in someone else's speech authentication/speaker recognition library (which will probably be in C or C++), or you can try to write your own.
How many people are going to use your app? You might be able to create something basic yourself: If it's the difference between a man and a woman you could probably figure that out by doing an FFT spectral analysis of the audio and figure out where the frequency peaks are. Obviously the frequencies used to enunciate different phonemes are going to vary somewhat, so solving the general case for two people who sound fairly similar is probably hard. You'll need to train the system with a bunch of text and build some kind of model of frequency distributions. You could try to do clustering or something, but you're going to run into a fair bit of maths fairly quickly (gaussian mixture models, et al). There are libraries/projects that'll do this. You might be able to port this from matlab, for example: https://github.com/codyaray/speaker-recognition
If you want to take something off-the-shelf, I'd go with a straight C library like mistral, as it should be relatively easy to call into from Objective-C.
The SpeakHere sample code should get you started for audio recording and playback.
Also, it may well take longer for the user to train your app to recognise them than it's worth in time-saving from just picking their name from a list. Unless you're intending their voice to be some kind of security passport type thing, it might just not be worth bothering with.

Any good (free) text-to-speech engines out there?

I've been scouring the SO board and google and can't find any really good recommendations for this. I'm building a Twilio application and the text-to-speech (TTS) engine is way bad. Plus, it's a pain in the ass to test since I have to deploy every time. Is there a significantly better resource out there that could render to a WAV or MP3 file so I can save and use that instead? Maybe there's a great API for this somewhere. I just want to avoid recording 200 MP3 files myself, would rather have this generated programatically...
Things I've seen and rejected:
http://www.yakitome.com/ (I couldn't force myself to give them my email)
http://www2.research.att.com/~ttsweb/tts/demo.php
http://www.naturalreaders.com/index.htm
http://www.panopreter.com/index.php (on the basis of crappy website)
Thinking of paying for this, but not sure yet: https://ondemand.neospeech.com/
Obviously I'm new to this, if I'm missing something obvious, please point it out...
I am not sure if you have access to a mac computer or not. Mac has pretty advanced tts built into the operating system. Apple spent a lot of money on top engineers to research it. It can easily be controlled and even automated from the command prompt. It also has quite a few built in voices to choose from. That is what I used on a recent phone system I put up. But I realize that this is not an option if you don't have a mac.
Another one you might want to check into is http://cepstral.com/ they have very realistic voices. I think they used to be open source but they are no longer and now you need to pay licensing fees. They are very commonly used for high end commercial applications. And are not so much geared towards the home user that wants their article read to them.
I like the YAKiToMe! website the best. It's free and the voices are top quality. In case you're still worried about giving them your email, they've never spammed me in many years of use and I never got onto any spam lists after signing up with them, so I doubt they sold my email. Anyway, the service is great and has lots of features for turning electronic text into audio files in different languages.
As for the API you're looking for, YAKiToMe! has a well-documented API and it's free to use. You have to register with the site to use it, but that's because it lets you customize pronunciation and voice selection, so it needs to differentiate you from other users.

How to add video chat capability to my rails application?

I've been googling about how to accomplish oneline chat for rails application and I've implemented a text chat version using Juggernaut. But for video chat, I only find flash&red5 might be a solution for me, but it's flash and java, which I'm relatively not good at. Is there a better solution?
I have researched this a little and the options are not great.
There are some nice-looking packages that cost real money. Tokbox.com, ooVoo (http://www.oovoo.com, looks free at first but free version has limits), VideoWhisper.com looks interesting -- they seem to sell a 2-way video chat package for a one-time fee, but it is PHP-based.
A simple Flash-based solution is definitely a good way to go. Flash has good video support and virtually everyone has it installed already. It's not that hard to learn enough Flash basics to do a simple 2-way video chat (see http://www.derekentringer.com/blog/fms-video-chat/ for an example of a trivial video chat script that is something like 30 lines of code). And you don't need to learn Java to use Red5 unless you want to customize it -- Red5 is the open-source video streaming server that makes it so you don't need to buy really a expensive Adobe Media Streaming server system. You just need to learn enough to set it up and get it running.
I'm certain there's an open-source or low-cost Flash script out there that handles a basic 2-way video chat, but I have yet to find it. If anyone does, please post it!
Here is a rails implementation of tokbox:
http://github.com/njacobeus/tokboxer/
Try using Raydash. There is a rails gem available at https://github.com/gersh/Raydash-Ruby-on-Rails. You just need to register at http://www.raydash.com to use it.

Resources