Isolate the speech of two people in an audio record with two people talking [closed]2019 Community Moderator ElectionTiming/Recording input() in Python 3.1Open Source Software For Transcribing Speech in Audio FilesUsing Voice instead of 'raw_input' in python for Mac OS XArtificial Intelligence for Rule Based Filtering with PythonHow to synchronise two audio files using a markerAudio/speech processing (with python's pymedia library) support in Robot Framework for automationAudio alignment (same sentence with different speakers)Speech recognition for python, raspberry piAcurately mixing two notes over each otherGoogle Speech API ignores silence in audio

Is it possible to have an Abelian group under two different binary operations but the binary operations are not distributive?

Word for a person who has no opinion about whether god exists

Placing subfig vertically

What wound would be of little consequence to a biped but terrible for a quadruped?

How did Alan Turing break the enigma code using the hint given by the lady in the bar?

Why don't MCU characters ever seem to have language issues?

My story is written in English, but is set in my home country. What language should I use for the dialogue?

Solving "Resistance between two nodes on a grid" problem in Mathematica

What to do when during a meeting client people start to fight (even physically) with each others?

How could our ancestors have domesticated a solitary predator?

Low budget alien movie about the Earth being cooked

Good allowance savings plan?

Reverse string, can I make it faster?

If the Captain's screens are out, does he switch seats with the co-pilot?

Why doesn't this Google Translate ad use the word "Translation" instead of "Translate"?

Is there an equal sign with wider gap?

Space in array system equations

Best approach to update all entries in a list that is paginated?

Does "variables should live in the smallest scope as possible" include the case "variables should not exist if possible"?

The bar has been raised

Set and print content of environment variable in cmd.exe subshell?

Rejected in 4th interview round citing insufficient years of experience

PTIJ: Why can't I eat anything?

Why does the negative sign arise in this thermodynamic relation?

Isolate the speech of two people in an audio record with two people talking [closed]

2019 Community Moderator ElectionTiming/Recording input() in Python 3.1Open Source Software For Transcribing Speech in Audio FilesUsing Voice instead of 'raw_input' in python for Mac OS XArtificial Intelligence for Rule Based Filtering with PythonHow to synchronise two audio files using a markerAudio/speech processing (with python's pymedia library) support in Robot Framework for automationAudio alignment (same sentence with different speakers)Speech recognition for python, raspberry piAcurately mixing two notes over each otherGoogle Speech API ignores silence in audio

-4

I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :

[
 
 "voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
 "sentences": 
 
 "sentence": "do you like red apples",
 "position": 1.39 // Seconds. Time position in the audio record
 ,
 
 "sentence": "and how do you feel about time shifts",
 "position": 7.21
 
 
 ,
 
 "voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
 "sentences": 
 
 "sentence": "yes I do like them",
 "position": 4.54
 ,
 
 "sentence": "i feel well about traveling",
 "position": 10.18
 
 
 
]

This may be an interview record.

The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.

Preferably in Python.

Have you ever worked on this ?

Do you have any hints ?

edited Mar 6 at 16:38

asked Mar 6 at 16:13

Flavien B.

87210

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.

This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31

add a comment |

-4

I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :

[
 
 "voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
 "sentences": 
 
 "sentence": "do you like red apples",
 "position": 1.39 // Seconds. Time position in the audio record
 ,
 
 "sentence": "and how do you feel about time shifts",
 "position": 7.21
 
 
 ,
 
 "voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
 "sentences": 
 
 "sentence": "yes I do like them",
 "position": 4.54
 ,
 
 "sentence": "i feel well about traveling",
 "position": 10.18
 
 
 
]

This may be an interview record.

The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.

Preferably in Python.

Have you ever worked on this ?

Do you have any hints ?

edited Mar 6 at 16:38

asked Mar 6 at 16:13

Flavien B.

87210

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31

add a comment |

-4

I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :

[
 
 "voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
 "sentences": 
 
 "sentence": "do you like red apples",
 "position": 1.39 // Seconds. Time position in the audio record
 ,
 
 "sentence": "and how do you feel about time shifts",
 "position": 7.21
 
 
 ,
 
 "voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
 "sentences": 
 
 "sentence": "yes I do like them",
 "position": 4.54
 ,
 
 "sentence": "i feel well about traveling",
 "position": 10.18
 
 
 
]

This may be an interview record.

The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.

Preferably in Python.

Have you ever worked on this ?

Do you have any hints ?

edited Mar 6 at 16:38

asked Mar 6 at 16:13

Flavien B.

87210

I would like to find a way to isolate the speech of each of the people in an audio record so I can create a file of that form :

[
 
 "voice_fingerprint": "701066EDD3A0A40A2F53F61EAFD0E6AB",
 "sentences": 
 
 "sentence": "do you like red apples",
 "position": 1.39 // Seconds. Time position in the audio record
 ,
 
 "sentence": "and how do you feel about time shifts",
 "position": 7.21
 
 
 ,
 
 "voice_fingerprint": "8FFEA051AF3E3FB9A80A51A98FE05896",
 "sentences": 
 
 "sentence": "yes I do like them",
 "position": 4.54
 ,
 
 "sentence": "i feel well about traveling",
 "position": 10.18
 
 
 
]

This may be an interview record.

The problem IS NOT the Speech to Text, but to isolate the two people's sentences, in the way we can know at which time, which person has spoken which word.

Preferably in Python.

Have you ever worked on this ?

Do you have any hints ?

python artificial-intelligence audio-processing natural-language-processing

edited Mar 6 at 16:38

asked Mar 6 at 16:13

Flavien B.

87210

edited Mar 6 at 16:38

asked Mar 6 at 16:13

Flavien B.

87210

edited Mar 6 at 16:38

asked Mar 6 at 16:13

Flavien B.

87210

asked Mar 6 at 16:13

Flavien B.

87210

asked Mar 6 at 16:13

Flavien B.

87210

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31

add a comment |

This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31

This question is way too broad to get an actual answer. For example, are you talking about isolate the sentences after they are done talking or try and isolate the voices in the audio file and then send the text to speech.

– Edeki Okoh
Mar 6 at 16:31

add a comment |

0

active

oldest

votes

0

active

oldest

votes

0

active

oldest

votes

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ufdjrw

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

closed as too broad by DavidG, Jesper Juhl, Neil Butterworth, Idlehands, Prune Mar 7 at 0:09

0

0

0

Popular posts from this blog

Алба-Юлія

Захаров Федір Захарович