INDEX
    Explanations

    Telling/saying

    New Auto-Interp
    Negative Logits
     Compared
    -0.08
    .the
    -0.08
     Whatever
    -0.08
    _AV
    -0.07
    Compared
    -0.07
     foc
    -0.07
     beautifully
    -0.07
     kleurr
    -0.07
     compared
    -0.07
    .Ab
    -0.07
    POSITIVE LOGITS
     hiatus
    0.09
     आदेश
    0.09
     delimiter
    0.09
     EOF
    0.09
     વિર
    0.09
    停止
    0.08
     cues
    0.08
    暂停
    0.08
     cue
    0.08
     akkoord
    0.08
    Act Density 0.005%

    No Known Activations