INDEX
Negative Logits
under
-0.91
Under
-0.76
wane
-0.72
>*
-0.71
шти
-0.70
drove
-0.68
唯一的
-0.68
labored
-0.67
beobachten
-0.66
oblig
-0.65
POSITIVE LOGITS
мем
0.73
ulously
0.71
prefix
0.70
Pineda
0.66
ComponentName
0.66
ḍ
0.66
acão
0.65
FACT
0.64
prin
0.63
lidt
0.63
Activations Density 0.052%