INDEX
Explanations
nouns referring to persons or entities
references to authors or personalities mentioned in the text
New Auto-Interp
Negative Logits
Creed
-0.73
Ghosts
-0.64
cy
-0.60
mode
-0.60
jog
-0.60
speed
-0.59
strength
-0.59
reflex
-0.59
turbo
-0.58
sac
-0.58
POSITIVE LOGITS
eller
4.96
elling
2.03
ellar
1.79
ell
1.70
elled
1.59
ells
1.37
ellery
1.32
ellen
1.32
ella
1.26
eele
1.24
Activations Density 0.006%