INDEX
    Explanations

    punctuation marks and their frequency

    New Auto-Interp
    Negative Logits
    ÃŃg
    -0.14
    hee
    -0.14
    ình
    -0.14
    ška
    -0.14
    serter
    -0.14
     mình
    -0.14
    inger
    -0.14
    lÃŃn
    -0.14
    _nt
    -0.14
    _FILENO
    -0.14
    POSITIVE LOGITS
    COPY
    0.32
     Because
    0.19
    ecause
    0.19
     because
    0.19
     Despite
    0.19
     Aside
    0.19
     despite
    0.18
     Furthermore
    0.18
    Furthermore
    0.18
    Because
    0.17
    Act Density 0.005%

    No Known Activations