INDEX
Explanations
references to various individuals' names
instances of the word "om."
New Auto-Interp
Negative Logits
LIST
-0.69
++++++++++++++++
-0.66
convict
-0.62
nexus
-0.61
separatist
-0.59
fact
-0.59
REDACTED
-0.59
Kindle
-0.58
Brees
-0.57
Wad
-0.57
POSITIVE LOGITS
puter
1.14
orrow
1.14
obile
1.13
otive
1.07
atoes
1.05
useum
1.02
obiles
0.98
otor
0.96
pson
0.96
ikawa
0.96
Activations Density 0.013%