INDEX
Negative Logits
tuning
-0.09
Doll
-0.08
ияи
-0.08
हाल
-0.07
savings
-0.07
ования
-0.07
variability
-0.07
ον
-0.07
्यास
-0.07
Dart
-0.07
POSITIVE LOGITS
seventy
0.08
-tests
0.07
tests
0.07
Started
0.07
chapa
0.07
Prim
0.07
soaps
0.07
hok
0.07
(si
0.07
beendet
0.07
Activations Density 0.001%