INDEX
Explanations
consistent expressions of frequency and permanence
New Auto-Interp
Negative Logits
Cider
-0.43
rencontres
-0.40
organization
-0.40
ly
-0.39
ऽ
-0.36
organisation
-0.36
fören
-0.36
junit
-0.35
merve
-0.35
artige
-0.35
POSITIVE LOGITS
Always
1.18
Always
1.15
always
1.14
ALWAYS
1.08
ALWAYS
1.04
always
1.03
alway
0.91
Siempre
0.90
siempre
0.89
Siempre
0.84
Activations Density 0.075%