INDEX
Explanations
proper nouns related to historic or significant events or people
instances of the verb "became"
New Auto-Interp
Negative Logits
inarily
-0.75
oning
-0.73
enger
-0.71
atching
-0.71
inately
-0.71
aging
-0.68
been
-0.66
otropic
-0.66
Glover
-0.65
mouth
-0.64
POSITIVE LOGITS
accustomed
0.82
entangled
0.81
embroiled
0.77
extinct
0.76
oslav
0.75
victorious
0.75
ATURES
0.75
nces
0.74
ens
0.73
withdrawn
0.72
Activations Density 0.047%