Skip to content
  • Categories
  • Recent
  • Tags
  • All Topics
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Caint logo. It's just text.
  1. Home
  2. Technology
  3. OK. I'm at wit's end attempting to convince Google's LLM to pronounce an English name correctly.
Welcome to Caint!

Issues? Post in Comments & Feedback
You can now view, reply, and favourite posts from the Fediverse. You can click here or click on the on the navigation bar on the left.

OK. I'm at wit's end attempting to convince Google's LLM to pronounce an English name correctly.

Scheduled Pinned Locked Moved Technology
technology
23 Posts 11 Posters 6 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • PowderhornP Powderhorn

    I know IPA (the linguistic term, not the beer … OK, I also know the beer, but that’s not important right now) … and, yeah, I tried that, but on a laptop without a numpad, it’s a bit of a slog.

    What was maddening was the LLM got it right somewhere around 10% of the time after I corrected it. This was a voice conversation, so every time I corrected it, that should have been clear data. Aren’t these systems simply supposed to be pattern recognition? How is it outputting wildly different pronunciations (N>5) with constant inputs?

    H This user is from outside of this forum
    H This user is from outside of this forum
    howrar@lemmy.ca
    wrote last edited by howrar@lemmy.ca
    #21

    I’m pretty sure whatever voice system you’re using is just transcribing things to text and feeding it into an LLM, so it wouldn’t actually have that audio data. I’m not aware of any audio equivalent of LLMs existing.

    PowderhornP 1 Reply Last reply
    2
    • H howrar@lemmy.ca

      I’m pretty sure whatever voice system you’re using is just transcribing things to text and feeding it into an LLM, so it wouldn’t actually have that audio data. I’m not aware of any audio equivalent of LLMs existing.

      PowderhornP This user is from outside of this forum
      PowderhornP This user is from outside of this forum
      Powderhorn
      wrote last edited by powderhorn@beehaw.org
      #22

      The equivalent is NLP (natural language processing), which was already a huge research area in the '90s. In fact, had I not been a fucking idiot and caught the journalism bug, with my studies in CS and linguistics, I’d likely be doing quite well.

      This said, that was about voice input being converted to text – e.g., Dragon Naturally Speaking – but apparently little progress has been made going in the other direction. NotebookLM had other weird glitches where standard English words get weird vowels some 5% of the time.

      1 Reply Last reply
      0
      • PowderhornP Powderhorn

        Seriously, 15 times is my limit on correcting an LLM.

        The name in question? Rach. Google absolutely cannot pronounce it in any other way than assuming I was referring to Louise Fletcher in the diminutive.

        Specifying “long a” did nothing, and now I’m past livid. If you can’t handle a common English name, why would I trust you with anything else?

        This is my breaking point with LLMs. They’re fucking idiotic and can’t learn how to pronounce English words auf Englisch.

        I hope the VCs also die in a fire.

        🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 K This user is from outside of this forum
        🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 K This user is from outside of this forum
        🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮
        wrote last edited by
        #23

        Short for Rachel?

        Ray-tch. Rachel, but without the “el.”

        1 Reply Last reply
        0
        Reply
        • Reply as topic
        Log in to reply
        • Oldest to Newest
        • Newest to Oldest
        • Most Votes


        • Login

        • Don't have an account? Register

        • Login or register to search.
        • First post
          Last post
        0
        • Categories
        • Recent
        • Tags
        • All Topics
        • Popular
        • World
        • Users
        • Groups