INDEX
Negative Logits
.",↵
-0.09
."↵
-0.09
yum
-0.09
stove
-0.09
gemakkelijk
-0.08
полз
-0.08
." ↵
-0.08
સરળ
-0.08
quidem
-0.08
wraz
-0.08
POSITIVE LOGITS
An
0.07
weitere
0.07
Arten
0.07
vehicles
0.07
Degrees
0.07
Vehicles
0.07
Vehicles
0.07
Clearing
0.07
deven
0.07
Conversely
0.07
Activations Density 0.000%