Edit to add: I also found someone who recorded a voice chat of the same thing. This isn’t that someone uploaded a song, or that AI didn’t actually process the file. These models really are this sycophantic:

https://m.youtube.com/shorts/JqvDLHshTtI

  • Wispy2891@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    1 day ago

    It’s the same stuff if you ask it to help you master a foreign language in dialogue mode

    • Help me practicing English pronunciation"

    • Sure thing! Now say “travel”

    • *user just belches in the mic*

    • great job! You really aced the accent!

  • constantturtleaction@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    The reality is that it is a next word prediction machine. There probably aren’t any examples in the training data where people are writing music reviews on something that isn’t music. It probably interprets the sound as best as it can as “music” (and the best it can do this is likely very bad in the first place), and then, since the prompt was about reviewing music, it uses next word prediction to write a music review, which of course turns out looking like a typical music review. It’s not really interpreting the sound as “not music” especially since you told it it IS music.

    • canihasaccount@lemmy.worldOP
      link
      fedilink
      arrow-up
      0
      ·
      2 days ago

      True, but the principle behind the post is the beauty here. When not using the API, it costs these companies an unsustainable amount of money to make their models listen to fart sounds. I don’t use any AI myself, but I support anyone who wants to abuse the flat monthly subscription to make a company burn through money so that a plagiarism model can praise fart sounds.

  • altphoto@lemmy.today
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    Its in mp4, usually great for music and video, millions might enjoy listening to music such as the notes contained within this file.

    Should I connect you with a music publisher for a quote? Should I publish to YouTube to monetize rights way…sorry right away?

  • DacoTaco@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    Oh fuck, this gives me the idea to (locally) train an ai on fart noises and tell it its a way of communication to learn

  • taiyang@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    Quiet city montage… lol.

    I’m old enough to remember an April Fools Day bit Adult Swim once did where every show just had consistent, random fart sounds including serious anime like Ghost in the Shell. I’m certain this LLM would approve of such a work of (f)art.

    • wizardbeard@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 days ago

      I don’t know if I’ll ever find it again, but there’s a “restored” version of that GiTS episode up on the internet archive. Someone took a bluray copy of the video and audio, and mixed in the relevant fart sounds from the audio of a VHS recording of the adult swim edit that was tracked down.

    • ouRKaoS@lemmy.today
      link
      fedilink
      arrow-up
      0
      ·
      2 days ago

      The year they drew moustaches randomly on everyone was my favorite.

      Faye Valentine from Cowboy Bebop had an epic one!

  • Anisette [any/all]@quokk.au
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 days ago

    I wonder if they actually linked it to an algorithm that analyses the sound that is looking for certain patterns or something, and that is why you get the “atmosphere piece” thing.

      • Anisette [any/all]@quokk.au
        link
        fedilink
        English
        arrow-up
        0
        ·
        2 days ago

        yeah but that doesn’t mean anything, does it? I don’t think they just tokenize the raw audio, that wouldn’t make sense, right?

        • sobchak@programming.dev
          link
          fedilink
          arrow-up
          0
          ·
          2 days ago

          I mean, you could. Just encode 100ms chunks or whatever into tokens then push them through the same model. I’m pretty sure that’s what the claim to do (though with MoE/routing now, maybe).