INDEX
Negative Logits
mson
-0.76
ngth
-0.74
hower
-0.72
perse
-0.68
RNA
-0.63
åŃ
-0.63
æĸ¹
-0.61
DragonMagazine
-0.61
selves
-0.61
REDACTED
-0.60
POSITIVE LOGITS
jamin
1.56
nington
1.08
oit
1.03
cher
0.93
ghazi
0.93
chers
0.92
ches
0.87
itude
0.84
ning
0.84
imaru
0.83
Activations Density 0.537%