INDEX
    Explanations

    instances of the word "ra" and its variations, indicating a focus on repeated syllables or sounds

    New Auto-Interp
    Negative Logits
    cta
    -0.19
    irma
    -0.19
    chia
    -0.19
    ä¸ī级
    -0.17
    aira
    -0.16
    razil
    -0.14
    iron
    -0.14
    arrant
    -0.14
    ιÏĥ
    -0.13
    aidu
    -0.13
    POSITIVE LOGITS
    er
    0.20
    odon
    0.16
    linger
    0.16
    onde
    0.15
    arth
    0.15
    lest
    0.15
    eof
    0.14
    wert
    0.14
    e
    0.14
    ZY
    0.14
    Act Density 0.033%

    No Known Activations