INDEX
Explanations
proper nouns or names of people, places, or events
instances of the word "became."
New Auto-Interp
Negative Logits
inarily
-0.76
inately
-0.72
oning
-0.71
ramid
-0.69
alian
-0.68
been
-0.67
enger
-0.64
atching
-0.64
orney
-0.63
otropic
-0.62
POSITIVE LOGITS
accustomed
0.89
entangled
0.87
extinct
0.84
embroiled
0.83
oslav
0.82
acquainted
0.79
undone
0.76
disillusion
0.75
nces
0.74
ATURES
0.73
Activations Density 0.046%