INDEX
Negative Logits
_press
-0.08
Driven
-0.07
�
-0.07
Mosk
-0.07
-spacing
-0.07
ulsive
-0.07
cement
-0.07
presses
-0.07
म
-0.07
Spir
-0.07
POSITIVE LOGITS
tasks
0.10
tarefas
0.09
tareas
0.09
(tasks
0.08
tâches
0.08
impossible
0.08
ndares
0.08
.tasks
0.08
claiming
0.08
ngoại
0.08
Activations Density 0.004%