INDEX
    Explanations

    occurrences of the dollar sign in various contexts

    New Auto-Interp
    Negative Logits
    joy
    -0.18
    ess
    -0.16
    elez
    -0.16
    cctor
    -0.16
    AYOUT
    -0.15
    iron
    -0.15
    ekler
    -0.14
    ниÑĨа
    -0.14
     mus
    -0.14
    hs
    -0.14
    POSITIVE LOGITS
    nez
    0.16
    erli
    0.15
    imed
    0.15
    ilha
    0.15
    کر
    0.14
    632
    0.14
     Trem
    0.14
    lique
    0.14
    quila
    0.14
    elves
    0.14
    Act Density 0.011%

    No Known Activations