INDEX
Negative Logits
potion
-0.71
ayne
-0.70
lua
-0.68
eric
-0.65
heed
-0.64
nexus
-0.63
jri
-0.62
ipeg
-0.62
skill
-0.61
yles
-0.61
POSITIVE LOGITS
outlets
1.18
outlet
1.06
eval
0.79
amplify
0.76
Ignore
0.73
airing
0.72
NPR
0.70
ARTICLE
0.69
Beir
0.68
CNN
0.68
Activations Density 0.041%