Edit to add: I also found someone who recorded a voice chat of the same thing. This isn’t that someone uploaded a song, or that AI didn’t actually process the file. These models really are this sycophantic:

https://m.youtube.com/shorts/JqvDLHshTtI

  • constantturtleaction@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    The reality is that it is a next word prediction machine. There probably aren’t any examples in the training data where people are writing music reviews on something that isn’t music. It probably interprets the sound as best as it can as “music” (and the best it can do this is likely very bad in the first place), and then, since the prompt was about reviewing music, it uses next word prediction to write a music review, which of course turns out looking like a typical music review. It’s not really interpreting the sound as “not music” especially since you told it it IS music.

    • canihasaccount@lemmy.worldOP
      link
      fedilink
      arrow-up
      0
      ·
      2 months ago

      True, but the principle behind the post is the beauty here. When not using the API, it costs these companies an unsustainable amount of money to make their models listen to fart sounds. I don’t use any AI myself, but I support anyone who wants to abuse the flat monthly subscription to make a company burn through money so that a plagiarism model can praise fart sounds.

  • altphoto@lemmy.today
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    Its in mp4, usually great for music and video, millions might enjoy listening to music such as the notes contained within this file.

    Should I connect you with a music publisher for a quote? Should I publish to YouTube to monetize rights way…sorry right away?

  • Anisette [any/all]@quokk.au
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    I wonder if they actually linked it to an algorithm that analyses the sound that is looking for certain patterns or something, and that is why you get the “atmosphere piece” thing.

      • Anisette [any/all]@quokk.au
        link
        fedilink
        English
        arrow-up
        0
        ·
        2 months ago

        yeah but that doesn’t mean anything, does it? I don’t think they just tokenize the raw audio, that wouldn’t make sense, right?

        • sobchak@programming.dev
          link
          fedilink
          arrow-up
          0
          ·
          2 months ago

          I mean, you could. Just encode 100ms chunks or whatever into tokens then push them through the same model. I’m pretty sure that’s what the claim to do (though with MoE/routing now, maybe).

  • Vinny_93@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    I asked it for a realistic bid for a house, I sent the same text 3 as with 3 different numbers I was thinking of and it told me pretty much the same thing for all of them. I then called it out about it and only then did I receive a decent answer.

  • Wispy2891@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    It’s the same stuff if you ask it to help you master a foreign language in dialogue mode

    • Help me practicing English pronunciation"

    • Sure thing! Now say “travel”

    • *user just belches in the mic*

    • great job! You really aced the accent!