INDEX
Negative Logits
osi
-0.77
iversal
-0.75
terness
-0.73
Seym
-0.72
odies
-0.71
ibaba
-0.70
vez
-0.70
ccording
-0.68
ione
-0.63
yip
-0.63
POSITIVE LOGITS
åij
0.69
or
0.67
alone
0.63
istg
0.63
ford
0.62
overnight
0.61
bare
0.61
EngineDebug
0.60
,
0.60
iHUD
0.58
Activations Density 0.020%