INDEX
Explanations
words indicating quantity or abundance
New Auto-Interp
Negative Logits
icken
-0.16
base
-0.14
certain
-0.14
consistent
-0.14
w
-0.14
oad
-0.14
Certain
-0.14
istrovstvÃŃ
-0.13
g
-0.13
odes
-0.13
POSITIVE LOGITS
times
0.20
sclerosis
0.19
-times
0.17
ishi
0.17
vezes
0.17
ãĢħ
0.17
different
0.15
-many
0.15
ligne
0.14
times
0.14
Activations Density 0.020%