INDEX
    Explanations

    expressions of personal feelings and thoughts

    New Auto-Interp
    Negative Logits
    itech
    -0.17
    ongan
    -0.16
    itto
    -0.15
     odv
    -0.15
    วà¸Ļ
    -0.15
    å¯
    -0.14
    fu
    -0.14
    ALSE
    -0.14
    ilight
    -0.14
    avid
    -0.14
    POSITIVE LOGITS
    838
    0.18
    836
    0.17
    Blo
    0.16
    zza
    0.16
    665
    0.15
     most
    0.15
    863
    0.15
    _PIPE
    0.15
    ableView
    0.15
    616
    0.15
    Act Density 0.054%

    No Known Activations