INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    (proto
    -0.07
     Reverse
    -0.06
    いる
    -0.06
    уют
    -0.06
    /sh
    -0.06
    이스
    -0.06
    料理
    -0.06
     completamente
    -0.06
    .detect
    -0.06
    ioneer
    -0.06
    POSITIVE LOGITS
    	boolean
    0.07
    <bool
    0.06
    among
    0.06
     ),
    0.06
    Aux
    0.06
    ernels
    0.06
    _SC
    0.06
     specializing
    0.06
    grupo
    0.06
     Сим
    0.06
    Act Density 0.050%

    No Known Activations