INDEX
Negative Logits
Byzantine
-0.08
Legion
-0.08
Gil
-0.08
Weber
-0.08
Gilbert
-0.08
optimaal
-0.08
Delphi
-0.08
tní
-0.08
Dennis
-0.07
Jerome
-0.07
POSITIVE LOGITS
sür
0.08
irectional
0.08
Such
0.08
ทุก
0.08
Excluded
0.07
configurable
0.07
Across
0.07
Water
0.07
Skipping
0.07
engers
0.07
Activations Density 0.002%