INDEX
Negative Logits
_ini
-0.08
anal
-0.08
langs
-0.08
commons
-0.08
row
-0.08
months
-0.08
rows
-0.08
lyn
-0.07
deserves
-0.07
connect
-0.07
POSITIVE LOGITS
excessive
0.14
overly
0.13
exces
0.13
overpower
0.13
overwhelming
0.12
excessively
0.12
demasiado
0.11
overcrow
0.11
demasi
0.11
troppo
0.11
Activations Density 0.038%