Comparing strings for mutabilityWhat is the difference between String and string in C#?How do I iterate over the words of a string?How do I read / convert an InputStream into a String in Java?Case insensitive 'Contains(string)'How do I make the first letter of a string uppercase in JavaScript?How to replace all occurrences of a string in JavaScriptHow to check whether a string contains a substring in JavaScript?Does Python have a string 'contains' substring method?How do I convert a String to an int in Java?Why is char[] preferred over String for passwords?

How to pronounce 'c++' in Spanish

Is there a word for the censored part of a video?

Why do real positive eigenvalues result in an unstable system? What about eigenvalues between 0 and 1? or 1?

What makes accurate emulation of old systems a difficult task?

How important is it that $TERM is correct?

What is purpose of DB Browser(dbbrowser.aspx) under admin tool?

Mistake in years of experience in resume?

Is Electric Central Heating worth it if using Solar Panels?

Why do games have consumables?

Why did Rep. Omar conclude her criticism of US troops with the phrase "NotTodaySatan"?

Older movie/show about humans on derelict alien warship which refuels by passing through a star

Was Dennis Ritchie being too modest in this quote about C and Pascal?

Injection into a proper class and choice without regularity

How to keep bees out of canned beverages?

Do I need to watch Ant-Man and the Wasp and Captain Marvel before watching Avengers: Endgame?

Island of Knights, Knaves and Spies

Creating a chemical industry from a medieval tech level without petroleum

Why must Chinese maps be obfuscated?

My bank got bought out, am I now going to have to start filing tax returns in a different state?

I preordered a game on my Xbox while on the home screen of my friend's account. Which of us owns the game?

What is the unit of time_lock_delta in LND?

Retract an already submitted recommendation letter (written for an undergrad student)

Combinatorics problem, right solution?

How much cash can I safely carry into the USA and avoid civil forfeiture?



Comparing strings for mutability


What is the difference between String and string in C#?How do I iterate over the words of a string?How do I read / convert an InputStream into a String in Java?Case insensitive 'Contains(string)'How do I make the first letter of a string uppercase in JavaScript?How to replace all occurrences of a string in JavaScriptHow to check whether a string contains a substring in JavaScript?Does Python have a string 'contains' substring method?How do I convert a String to an int in Java?Why is char[] preferred over String for passwords?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;








0















I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).



For example "price" is the correct transcription and the sequence "ric" would be tagged as likely to be correct:



price,0.8999
perica,0.6221
price,0.7847
perice,0.7208
pricee,0.7843
parice,0.6866
parica,0.5879
pricea,0.7861
brice,0.6957
prize,0.7438
pric,0.8811
terica,0.4677
brica,0.5805
pericee,0.6420
priceee,0.6976
prace,0.7502
trica,0.5805
terice,0.5664
trice,0.6957
prico,0.7828


Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.



The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.



Does anyone know of any existing algorithms that could be adapted to such a task?



Thanks










share|improve this question




























    0















    I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).



    For example "price" is the correct transcription and the sequence "ric" would be tagged as likely to be correct:



    price,0.8999
    perica,0.6221
    price,0.7847
    perice,0.7208
    pricee,0.7843
    parice,0.6866
    parica,0.5879
    pricea,0.7861
    brice,0.6957
    prize,0.7438
    pric,0.8811
    terica,0.4677
    brica,0.5805
    pericee,0.6420
    priceee,0.6976
    prace,0.7502
    trica,0.5805
    terice,0.5664
    trice,0.6957
    prico,0.7828


    Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.



    The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.



    Does anyone know of any existing algorithms that could be adapted to such a task?



    Thanks










    share|improve this question
























      0












      0








      0








      I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).



      For example "price" is the correct transcription and the sequence "ric" would be tagged as likely to be correct:



      price,0.8999
      perica,0.6221
      price,0.7847
      perice,0.7208
      pricee,0.7843
      parice,0.6866
      parica,0.5879
      pricea,0.7861
      brice,0.6957
      prize,0.7438
      pric,0.8811
      terica,0.4677
      brica,0.5805
      pericee,0.6420
      priceee,0.6976
      prace,0.7502
      trica,0.5805
      terice,0.5664
      trice,0.6957
      prico,0.7828


      Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.



      The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.



      Does anyone know of any existing algorithms that could be adapted to such a task?



      Thanks










      share|improve this question














      I am using a speech transcription engine and it returns results with the associated confidence score that a given result is correct. I would like to look for patterns that show similarities between alternatives (implying a high likelihood that substring was correctly transcribed) and determine which letters/indexes are different amongst alternatives (implying a lower likelihood that they had been correctly transcribed).



      For example "price" is the correct transcription and the sequence "ric" would be tagged as likely to be correct:



      price,0.8999
      perica,0.6221
      price,0.7847
      perice,0.7208
      pricee,0.7843
      parice,0.6866
      parica,0.5879
      pricea,0.7861
      brice,0.6957
      prize,0.7438
      pric,0.8811
      terica,0.4677
      brica,0.5805
      pericee,0.6420
      priceee,0.6976
      prace,0.7502
      trica,0.5805
      terice,0.5664
      trice,0.6957
      prico,0.7828


      Also, the alternatives need to be weighted against their confidence score so as to avoid outliers (low confidence score alternatives) from having as significant an impact as high confidence score alternatives.



      The point of this is to be able to keep track of letter combinations that the transcription engine struggles with more than others.



      Does anyone know of any existing algorithms that could be adapted to such a task?



      Thanks







      string algorithm comparison






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Mar 9 at 7:44









      Harry StuartHarry Stuart

      412413




      412413






















          0






          active

          oldest

          votes












          Your Answer






          StackExchange.ifUsing("editor", function ()
          StackExchange.using("externalEditor", function ()
          StackExchange.using("snippets", function ()
          StackExchange.snippets.init();
          );
          );
          , "code-snippets");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "1"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55075147%2fcomparing-strings-for-mutability%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55075147%2fcomparing-strings-for-mutability%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          1928 у кіно

          Захаров Федір Захарович

          Ель Греко