Isolate the speech of two people in an audio record with two people talking [closed]2019 Community Moderator ElectionTiming/Recording input() in Python 3.1Open Source Software For Transcribing Speech in Audio FilesUsing Voice instead of 'raw_input' in python for Mac OS XArtificial Intelligence for Rule Based Filtering with PythonHow to synchronise two audio files using a markerAudio/speech processing (with python's pymedia library) support in Robot Framework for automationAudio alignment (same sentence with different speakers)Speech recognition for python, raspberry piAcurately mixing two notes over each otherGoogle Speech API ignores silence in audio

Is it possible to have an Abelian group under two different binary operations but the binary operations are not distributive?

Word for a person who has no opinion about whether god exists

Placing subfig vertically

What wound would be of little consequence to a biped but terrible for a quadruped?

How did Alan Turing break the enigma code using the hint given by the lady in the bar?

Why don't MCU characters ever seem to have language issues?

My story is written in English, but is set in my home country. What language should I use for the dialogue?

Solving "Resistance between two nodes on a grid" problem in Mathematica

What to do when during a meeting client people start to fight (even physically) with each others?

How could our ancestors have domesticated a solitary predator?

Low budget alien movie about the Earth being cooked

Good allowance savings plan?

Reverse string, can I make it faster?

If the Captain's screens are out, does he switch seats with the co-pilot?

Why doesn't this Google Translate ad use the word "Translation" instead of "Translate"?

Is there an equal sign with wider gap?

Space in array system equations

Best approach to update all entries in a list that is paginated?

Does "variables should live in the smallest scope as possible" include the case "variables should not exist if possible"?

The bar has been raised

Set and print content of environment variable in cmd.exe subshell?

Rejected in 4th interview round citing insufficient years of experience

PTIJ: Why can't I eat anything?

Why does the negative sign arise in this thermodynamic relation?



Isolate the speech of two people in an audio record with two people talking [closed]



2019 Community Moderator ElectionTiming/Recording input() in Python 3.1Open Source Software For Transcribing Speech in Audio FilesUsing Voice instead of 'raw_input' in python for Mac OS XArtificial Intelligence for Rule Based Filtering with PythonHow to synchronise two audio files using a markerAudio/speech processing (with python's pymedia library) support in Robot Framework for automationAudio alignment (same sentence with different speakers)Speech recognition for python, raspberry piAcurately mixing two notes over each otherGoogle Speech API ignores silence in audio










-4















I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :



[

"voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
"sentences":

"sentence": "do you like red apples",
"position": 1.39 // Seconds. Time position in the audio record
,

"sentence": "and how do you feel about time shifts",
"position": 7.21


,

"voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
"sentences":

"sentence": "yes I do like them",
"position": 4.54
,

"sentence": "i feel well about traveling",
"position": 10.18



]


This may be an interview record.



The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.



Preferably in Python.



  • Have you ever worked on this ?

  • Do you have any hints ?









share|improve this question















closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.


















  • This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

    – Edeki Okoh
    Mar 6 at 16:31















-4















I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :



[

"voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
"sentences":

"sentence": "do you like red apples",
"position": 1.39 // Seconds. Time position in the audio record
,

"sentence": "and how do you feel about time shifts",
"position": 7.21


,

"voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
"sentences":

"sentence": "yes I do like them",
"position": 4.54
,

"sentence": "i feel well about traveling",
"position": 10.18



]


This may be an interview record.



The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.



Preferably in Python.



  • Have you ever worked on this ?

  • Do you have any hints ?









share|improve this question















closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.


















  • This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

    – Edeki Okoh
    Mar 6 at 16:31













-4












-4








-4








I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :



[

"voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
"sentences":

"sentence": "do you like red apples",
"position": 1.39 // Seconds. Time position in the audio record
,

"sentence": "and how do you feel about time shifts",
"position": 7.21


,

"voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
"sentences":

"sentence": "yes I do like them",
"position": 4.54
,

"sentence": "i feel well about traveling",
"position": 10.18



]


This may be an interview record.



The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.



Preferably in Python.



  • Have you ever worked on this ?

  • Do you have any hints ?









share|improve this question
















I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :



[

"voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
"sentences":

"sentence": "do you like red apples",
"position": 1.39 // Seconds. Time position in the audio record
,

"sentence": "and how do you feel about time shifts",
"position": 7.21


,

"voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
"sentences":

"sentence": "yes I do like them",
"position": 4.54
,

"sentence": "i feel well about traveling",
"position": 10.18



]


This may be an interview record.



The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.



Preferably in Python.



  • Have you ever worked on this ?

  • Do you have any hints ?






python artificial-intelligence audio-processing natural-language-processing






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 6 at 16:38







Flavien B.

















asked Mar 6 at 16:13









Flavien B.Flavien B.

87210




87210




closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.









closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.














  • This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

    – Edeki Okoh
    Mar 6 at 16:31

















  • This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

    – Edeki Okoh
    Mar 6 at 16:31
















This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31





This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31












0






active

oldest

votes

















0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes

Popular posts from this blog

AWS Lex not identifying response if by a variable The 2019 Stack Overflow Developer Survey Results Are In Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) The Ask Question Wizard is Live! Data science time! April 2019 and salary with experienceEnforcing custom enumeration in AWS LEX for slot valuesHow to give response based on user response in Amazon Lex?Intercepting AWS Lambda Response to a AWS Lex QueryLex chat bot error: Reached second execution of fulfillment lambda on the same utteranceamazon lex showing invalid responseLambda response send back to Lex slot?Response card in Amazon lexAmazon Lex - Lambda response return HTML to botHow can I solve 424 (Failed Dependency) (python) obtained from Amazon lex?

Алба-Юлія

Захаров Федір Захарович