INDEX
Negative Logits
fixation
-0.72
DragonMagazine
-0.71
Colossus
-0.64
etary
-0.62
Closure
-0.60
spores
-0.60
sincerity
-0.60
bottom
-0.59
eering
-0.59
Daredevil
-0.59
POSITIVE LOGITS
ahah
1.11
oy
1.05
aha
1.02
mad
1.01
renheit
0.92
awk
0.88
ora
0.87
hh
0.87
jong
0.87
ghan
0.86
Activations Density 0.021%