INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    *,
    -0.07
     стара
    -0.07
     dct
    -0.06
     sofort
    -0.06
    -door
    -0.06
     sẽ
    -0.06
     blobs
    -0.06
     sensory
    -0.06
     öne
    -0.06
     newObj
    -0.06
    POSITIVE LOGITS
     okum
    0.07
    vement
    0.07
     profes
    0.06
    LEMENT
    0.06
    endo
    0.06
    VIOUS
    0.06
                      
    0.06
     znal
    0.06
    ackle
    0.06
     rpm
    0.06
    Act Density 0.000%

    No Known Activations