Comparing strings for mutabilityWhat is the difference between String and string in C#?How do I iterate over the words of a string?How do I read / convert an InputStream into a String in Java?Case insensitive 'Contains(string)'How do I make the first letter of a string uppercase in JavaScript?How to replace all occurrences of a string in JavaScriptHow to check whether a string contains a substring in JavaScript?Does Python have a string 'contains' substring method?How do I convert a String to an int in Java?Why is char[] preferred over String for passwords?
How to pronounce 'c++' in Spanish
Is there a word for the censored part of a video?
Why do real positive eigenvalues result in an unstable system? What about eigenvalues between 0 and 1? or 1?
What makes accurate emulation of old systems a difficult task?
How important is it that $TERM is correct?
What is purpose of DB Browser(dbbrowser.aspx) under admin tool?
Mistake in years of experience in resume?
Is Electric Central Heating worth it if using Solar Panels?
Why do games have consumables?
Why did Rep. Omar conclude her criticism of US troops with the phrase "NotTodaySatan"?
Older movie/show about humans on derelict alien warship which refuels by passing through a star
Was Dennis Ritchie being too modest in this quote about C and Pascal?
Injection into a proper class and choice without regularity
How to keep bees out of canned beverages?
Do I need to watch Ant-Man and the Wasp and Captain Marvel before watching Avengers: Endgame?
Island of Knights, Knaves and Spies
Creating a chemical industry from a medieval tech level without petroleum
Why must Chinese maps be obfuscated?
My bank got bought out, am I now going to have to start filing tax returns in a different state?
I preordered a game on my Xbox while on the home screen of my friend's account. Which of us owns the game?
What is the unit of time_lock_delta in LND?
Retract an already submitted recommendation letter (written for an undergrad student)
Combinatorics problem, right solution?
How much cash can I safely carry into the USA and avoid civil forfeiture?
Comparing strings for mutability
What is the difference between String and string in C#?How do I iterate over the words of a string?How do I read / convert an InputStream into a String in Java?Case insensitive 'Contains(string)'How do I make the first letter of a string uppercase in JavaScript?How to replace all occurrences of a string in JavaScriptHow to check whether a string contains a substring in JavaScript?Does Python have a string 'contains' substring method?How do I convert a String to an int in Java?Why is char[] preferred over String for passwords?
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;
I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).
For example "price"
is the correct transcription and the sequence "ric"
would be tagged as likely to be correct:
price,0.8999
perica,0.6221
price,0.7847
perice,0.7208
pricee,0.7843
parice,0.6866
parica,0.5879
pricea,0.7861
brice,0.6957
prize,0.7438
pric,0.8811
terica,0.4677
brica,0.5805
pericee,0.6420
priceee,0.6976
prace,0.7502
trica,0.5805
terice,0.5664
trice,0.6957
prico,0.7828
Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.
The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.
Does anyone know of any existing algorithms that could be adapted to such a task?
Thanks
string algorithm comparison
add a comment |
I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).
For example "price"
is the correct transcription and the sequence "ric"
would be tagged as likely to be correct:
price,0.8999
perica,0.6221
price,0.7847
perice,0.7208
pricee,0.7843
parice,0.6866
parica,0.5879
pricea,0.7861
brice,0.6957
prize,0.7438
pric,0.8811
terica,0.4677
brica,0.5805
pericee,0.6420
priceee,0.6976
prace,0.7502
trica,0.5805
terice,0.5664
trice,0.6957
prico,0.7828
Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.
The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.
Does anyone know of any existing algorithms that could be adapted to such a task?
Thanks
string algorithm comparison
add a comment |
I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).
For example "price"
is the correct transcription and the sequence "ric"
would be tagged as likely to be correct:
price,0.8999
perica,0.6221
price,0.7847
perice,0.7208
pricee,0.7843
parice,0.6866
parica,0.5879
pricea,0.7861
brice,0.6957
prize,0.7438
pric,0.8811
terica,0.4677
brica,0.5805
pericee,0.6420
priceee,0.6976
prace,0.7502
trica,0.5805
terice,0.5664
trice,0.6957
prico,0.7828
Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.
The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.
Does anyone know of any existing algorithms that could be adapted to such a task?
Thanks
string algorithm comparison
I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).
For example "price"
is the correct transcription and the sequence "ric"
would be tagged as likely to be correct:
price,0.8999
perica,0.6221
price,0.7847
perice,0.7208
pricee,0.7843
parice,0.6866
parica,0.5879
pricea,0.7861
brice,0.6957
prize,0.7438
pric,0.8811
terica,0.4677
brica,0.5805
pericee,0.6420
priceee,0.6976
prace,0.7502
trica,0.5805
terice,0.5664
trice,0.6957
prico,0.7828
Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.
The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.
Does anyone know of any existing algorithms that could be adapted to such a task?
Thanks
string algorithm comparison
string algorithm comparison
asked Mar 9 at 7:44
Harry StuartHarry Stuart
412413
412413
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55075147%2fcomparing-strings-for-mutability%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55075147%2fcomparing-strings-for-mutability%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown