INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     dint
    -0.07
     нуж
    -0.07
     Scaffold
    -0.06
    167
    -0.06
     सर
    -0.06
     hvě
    -0.06
     hinted
    -0.06
     Sundays
    -0.06
     kraj
    -0.06
     vej
    -0.06
    POSITIVE LOGITS
     Com
    0.12
    com
    0.11
     com
    0.11
    Com
    0.11
    COM
    0.10
    /com
    0.10
    -com
    0.10
    (com
    0.09
    om
    0.09
    OM
    0.09
    Act Density 0.040%

    No Known Activations