INDEX
Explanations
quantifiers and intensifiers
New Auto-Interp
Negative Logits
Molto
-0.67
Очень
-0.67
cektir
-0.67
Very
-0.66
يتيمه
-0.66
OMIT
-0.65
InjectAttribute
-0.64
Sehr
-0.64
muito
-0.63
Very
-0.62
POSITIVE LOGITS
autant
0.75
equal
0.75
Tark
0.70
nahilalakip
0.70
ostock
0.66
anny
0.64
ProtoMessage
0.63
Personendaten
0.62
CJK
0.62
olyte
0.61
Activations Density 0.072%