Em dashes and emojis

  • alekwithak@lemmy.world
    link
    fedilink
    arrow-up
    22
    arrow-down
    1
    ·
    1 day ago

    Great catch! That’s a really interesting observation — but no, using em dashes and emojis alone is not a reliable way to tell AI text from human-written text.

    Here’s why:

    1️⃣ Humans and AI both use em dashes and emojis

    Skilled human writers often use em dashes for style, tone, or emphasis (like in essays, journalism, or fiction).

    Modern AI models, including ChatGPT, are trained on vast amounts of text — including texts that use em dashes extensively — so they use them naturally.

    2️⃣ Em dash frequency varies by context

    In formal writing (e.g., academic papers), em dashes are less common, regardless of author.

    In casual or conversational writing, both humans and AIs may use them liberally.

    3️⃣ Stylometric features are broader than one punctuation mark

    When people try to detect AI-generated text, they usually analyze a combination of features:

    Average sentence length

    Vocabulary richness

    Repetition patterns

    Syntactic structures

    Overuse or underuse of certain constructions

    Punctuation is just one small part of these analyses and isn’t decisive on its own.

    ✅ Bottom line: Em dashes can hint at style, but they aren’t a reliable “tell” for AI detection on their own. You need a holistic analysis of multiple stylistic and structural features to make a meaningful judgment.

    🤖 Why emojis aren’t a clear tell for AI

    1️⃣ AI can easily include emojis if prompted Modern AI models can and do use emojis naturally when asked to write in a casual or friendly tone. In fact, they can even mimic how humans use them in different contexts (e.g., sparingly or heavily, ironically or sincerely).

    2️⃣ Humans vary wildly in emoji usage Some humans use emojis constantly, especially in texting or on social media. Others almost never use them, even in casual writing. Age, culture, and personal style all influence this.

    3️⃣ Emojis can be explicitly requested or omitted If you tell an AI “don’t use emojis,” it won’t. Similarly, you can tell it “use lots of emojis,” and it will. So it’s not an inherent trait.

    4️⃣ Stylometric detection relies on more than one feature Like em dashes, emojis are only one aspect of style. Real detection tools look at patterns like sentence structure, repetitiveness, word choice entropy, and coherence across paragraphs — not single markers.


    ✅ When might emojis suggest AI text?

    If there is excessively consistent or mechanical emoji usage (e.g., one emoji at the end of every sentence, all very literal), it might suggest machine-generated text or an automated marketing bot.

    But even then, it’s not a guarantee — some humans also write this way, especially in advertising.


    💡 Bottom line: Emojis alone are not a reliable clue. You need a combination of markers — repetition, coherence, style shifts, and other linguistic fingerprints — to reasonably guess if something is AI-generated.

    If you’d like, I can walk you through some actual features that are better indicators (like burstiness, perplexity, or certain syntactic quirks). Want me to break that down?

    • Mustakrakish@lemmy.world
      link
      fedilink
      arrow-up
      3
      arrow-down
      1
      ·
      20 hours ago

      I’ve never seen em dahses outside of an academic paper, so saying people use them liberaly is an olypmic level stretch.

      Also that comment was clearly written by ai itself.

      • voodooattack@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        20 hours ago

        I use them often even when I’m not writing anything important, just a habit from writing I guess.

        Fuck. I just realised I used them in my résumé that I sent out yesterday. Shit shit shit

  • Net_Runner :~$@lemmy.zip
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 day ago

    The illiterate flocking to Lemmy to profess that they don’t know how to make em dashes, therefore it’s AI

  • rumba@lemmy.zip
    link
    fedilink
    English
    arrow-up
    19
    arrow-down
    6
    ·
    2 days ago

    Another take:

    She feels bad about it, wrote a incoherant babbling mess of run-on sentences and incoherant rants about your relationship, she then re-read it and found it to be disproportionately mean and possibly hurtful, She then shoved it all into an LLM and prompted:

    I’m breaking up with my boyfriend. This is all my natural heartfelt take on the situation <inserts text>, but I find the tone to be callous, angry, and hurtful. Can you please reword this to make the reader feel less attacked, possibly up to and including removing grievances, but at the same time making it clear that this decision is final and that I’d like to part ways amicably, and also that he’s not getting his dog back.

    • grahamja@reddthat.com
      link
      fedilink
      arrow-up
      13
      arrow-down
      1
      ·
      1 day ago

      Top comment is about how to get a machine to word something raw and emotional that should have been done in person. Nobody wants to get broken up with, let alone with a script written by a robot. Your take is off putting.

      • rumba@lemmy.zip
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        5
        ·
        1 day ago

        Yet we’re perfectly cool with a card from a department store claiming Happy anniversary to my beautiful wife and I’m so glad that you’re such a good mother to our kids.

        Anyone that has a take that is not shoving a red hot poker up AI’s ass gets down voted.

        I’m not here for the upvotes. Carry on. And please don’t take it personally, I do hope you have a solid day.

        • ipkpjersi@lemmy.ml
          link
          fedilink
          arrow-up
          2
          ·
          20 hours ago

          You’re giving her a card and flowers in person though, no? You’re not just texting it to her and that’s all she gets.

    • BigPotato@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      1 day ago

      She wants it reworded to be less hurtful but she’s keeping ‘his’ dog?

      She’d better start mentioning he kicked it or she just painted herself as… Well, not the worst but, like, really low… Ain’t no ‘amicable’ if you’re kidnapping the dog.

  • Echo Dot@feddit.uk
    link
    fedilink
    arrow-up
    41
    arrow-down
    1
    ·
    2 days ago

    I actually like using em dashes because it’s the correct thing to do. Also the Oxford comma, correct use of semi colon, and listing things in threes.

  • lemmy_outta_here@lemmy.world
    link
    fedilink
    arrow-up
    18
    ·
    2 days ago

    For me, it would take some of the sting out of the break-up.

    I would think to myself, “damn, how did I not realize that I was dating a lazy moron?”

  • SmokeyDope@lemmy.world
    link
    fedilink
    English
    arrow-up
    30
    arrow-down
    9
    ·
    2 days ago

    I’m a markdown nerd who likes to use headers to break up longer post and sometimes properly buletpoint or put ASCII art in preformatted boxes. People who thinks they have the magic sauce on LLM generation detection because a post goes out of their way to do more than the bre minimum with punctuation or formatting is an asshole.

      • SmokeyDope@lemmy.world
        link
        fedilink
        English
        arrow-up
        12
        arrow-down
        10
        ·
        edit-2
        2 days ago

        From the post title, description, and other peoples comments, I took away that the meme is m9re about suspecting your ex didnt even write their own breakup message based off the use of em dashes.

        Its a cute surface level joke but it touches in a real nerve because Its becoming more and more common for you to be falsely accused of being an LLM and being told to "ignore all previous instructions and (some stupid instruction) based off small writing quirks like using em or markdown and top comments share this frustration too.

        I shouldn’t have to feel self conscious about the way I write

        Just to pass armchair llm detector wannabe vibe checks 🖕. 
        
        • moakley@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          2 days ago

          I got told to “ignore previous instructions” because I said I liked looking at a painting. I think that’s just going to be an insult now.

        • zqps@sh.itjust.works
          link
          fedilink
          arrow-up
          1
          ·
          edit-2
          2 days ago

          If it’s a longer post it’s usually clear that it’s written by a human even with all of these superficial indicators.

        • mholiv@lemmy.world
          link
          fedilink
          arrow-up
          3
          arrow-down
          3
          ·
          edit-2
          2 days ago

          I mean most people are going to use their phones to write messages and given you can’t physically type an em dash it would be normal to be suspicious if you see one.

          Edit: turns out you can physically type them. Still, given that it’s not normal to use them it’s a sign in my book.

            • mholiv@lemmy.world
              link
              fedilink
              arrow-up
              5
              arrow-down
              3
              ·
              edit-2
              2 days ago

              Ok. You can physically type them I concede, but normal humans don’t use them. Still a sign.

              I would bet that the amount of non proof writers that uses em dashes goes up just because people see that it’s associated with ai and want to be funny.

                • mholiv@lemmy.world
                  link
                  fedilink
                  arrow-up
                  6
                  arrow-down
                  2
                  ·
                  2 days ago

                  Yah but your user name is “LanguageIsCool” and you talk about the fun levels of various types of punctuation. You are definitely the outlier here. A cool outlier but an outlier none the less.

      • Flickerby@lemmy.zip
        link
        fedilink
        arrow-up
        19
        arrow-down
        3
        ·
        edit-2
        3 days ago

        Yes, it’s nuanced but it breaks up the sentence flow in a different way. Also used for listing things. A comma is more a small pause, like this. A semicolon is used to differentiate two independent related thoughts ; it’s kind of a combination between a period and a comma. A dash can be used for many things - a longer separating of thoughts, listing different points such as this, or just as an intentional emphasis to add a more protracted pause.

        • TonyTonyChopper@mander.xyz
          link
          fedilink
          arrow-up
          8
          arrow-down
          2
          ·
          edit-2
          2 days ago

          Your comment has a hyphen not an em dash. The point of the post is that AI likes to include em dashes, which are wildly uncommon in modern text, as most keyboards don’t have a key for it

          • hibsen@lemmy.world
            link
            fedilink
            arrow-up
            1
            arrow-down
            1
            ·
            2 days ago

            They’re pretty trivial to make in any OS — having a dedicated key isn’t necessary.

              • hibsen@lemmy.world
                link
                fedilink
                arrow-up
                1
                ·
                2 days ago

                Odd, I see them used all the time, and I’m neither. So I guess either my experience is an outlier, everyone I talk to is secretly an LLM, or maybe the meme is pushing an easy conclusion because people in general are bad at picking up on LLM responses and want an easy punctuation mark so they don’t have to think.

                • isolatedscotch@discuss.tchncs.de
                  link
                  fedilink
                  arrow-up
                  2
                  ·
                  2 days ago

                  I see them used all the time

                  Weird, i hardly ever see a normal hyphen, let alone an em dash, but of course it’s not a foolproof method to detect ai, just a strong indicator

              • hibsen@lemmy.world
                link
                fedilink
                arrow-up
                1
                ·
                2 days ago

                I think this is something macOS does best — using shift+option hyphen is a bit quicker than alt+0151.

                • exasperation@lemmy.dbzer0.com
                  link
                  fedilink
                  arrow-up
                  3
                  ·
                  2 days ago

                  Long pressing the hyphen on the Google keyboard on Android also gives the option of selecting an en dash or em dash.

                  On Linux, if you have the compose key enabled, Compose key + three hyphens in a row will generate an em dash (en dash is two hyphens).

        • Dunstabzugshaubitze@feddit.org
          link
          fedilink
          arrow-up
          9
          arrow-down
          2
          ·
          3 days ago

          aah, so it’s more a choice about the intended sound or flow of a text and not necessary a difference in grammatic constructs, simmiliar to using an oxford comma, or not.

          interesting, i’ll have to pay some attention to that, when reading.

          • Flickerby@lemmy.zip
            link
            fedilink
            arrow-up
            13
            arrow-down
            2
            ·
            3 days ago

            It gives rhythm and flavor to your writing. Varied punctuation - where appropriate - is an easy way to spice anything up, give it a little more flavor, more control over how your words are read. There is a quote by Gary Provost that isn’t specifically about punctuation but illustrates this point well, it’s one of my favorites:

            This sentence has five words. Here are five more words. Five-word sentences are fine. But several together become monotonous. Listen to what is happening. The writing is getting boring. The sound of it drones. It’s like a stuck record. The ear demands some variety. Now listen. I vary the sentence length, and I create music. Music. The writing sings. It has a pleasant rhythm, a lilt, a harmony. I use short sentences. And I use sentences of medium length. And sometimes, when I am certain the reader is rested, I will engage him with a sentence of considerable length, a sentence that burns with energy and builds with all the impetus of a crescendo, the roll of the drums, the crash of the cymbals–sounds that say listen to this, it is important

          • exasperation@lemmy.dbzer0.com
            link
            fedilink
            arrow-up
            1
            ·
            2 days ago

            Em dashes often replace parentheses:

            The company has a policy of having any newly hired employee (like Steve) introduce themselves at the monthly all hands meeting.

            The company has a policy of having any newly hired employee—like Steve—introduce themselves at the monthly all hands meeting.

            Em dashes also often replace colons:

            I’m going to bring my signature dish: bacon-wrapped dates.

            I’m going to bring my signature dish—bacon-wrapped dates.

            Em dashes are commonly used to denote interrupted speech:

            He started to explain, “I was hungry and you weren’t home yet so I—”

            “You’re not a diabetic, you can handle waiting a few extra minutes to eat”

            Replacing commas is unusual and probably incorrect according to most style guides.

            This is also highly localized. Style guides tend to apply only to one particular country, not all English-speaking countries. The AP guide is used by most American newspapers and magazines, and the Chicago Guide is used by most American book publishers. Each have their own rules on dashes.

  • flambonkscious@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    19
    arrow-down
    1
    ·
    2 days ago

    Hasn’t word done this simce about offoce 2007? the autoformat as you type feature, specifically…

    I only know this because I fucking hate it and have been religiously turning it off