INDEX
Negative Logits
rganization
-0.20
riteln
-0.17
iec
-0.16
زد
-0.15
izable
-0.15
tsky
-0.14
Studi
-0.14
idges
-0.14
irtual
-0.14
reen
-0.13
POSITIVE LOGITS
ingles
0.16
orgia
0.16
impression
0.15
lit
0.14
omanip
0.14
ervo
0.14
errupted
0.14
reece
0.14
xaa
0.14
ypo
0.14
Activations Density 0.426%