INDEX
Negative Logits
arian
-0.15
trinsic
-0.15
ire
-0.14
\Context
-0.14
.Safe
-0.14
pk
-0.14
anal
-0.14
adil
-0.14
'gc
-0.14
ogn
-0.14
POSITIVE LOGITS
elen
0.17
erland
0.17
erdale
0.15
avia
0.15
ugal
0.15
ằng
0.15
tat
0.14
Cheer
0.14
Grant
0.14
Tat
0.14
Activations Density 0.069%