INDEX
    Explanations

    quantifiers and intensifiers

    New Auto-Interp
    Negative Logits
    Molto
    -0.67
    Очень
    -0.67
    cektir
    -0.67
    Very
    -0.66
     يتيمه
    -0.66
     OMIT
    -0.65
    InjectAttribute
    -0.64
    Sehr
    -0.64
    muito
    -0.63
     Very
    -0.62
    POSITIVE LOGITS
     autant
    0.75
     equal
    0.75
     Tark
    0.70
     nahilalakip
    0.70
    ostock
    0.66
     anny
    0.64
     ProtoMessage
    0.63
    Personendaten
    0.62
    CJK
    0.62
    olyte
    0.61
    Act Density 0.072%

    No Known Activations