INDEX
Explanations
words related to changes or transformations
instances of the word "become" to indicate changes or transformations
New Auto-Interp
Negative Logits
inarily
-0.80
oning
-0.73
ength
-0.72
inately
-0.70
erity
-0.69
bell
-0.68
idth
-0.66
aging
-0.65
Cran
-0.64
yip
-0.64
POSITIVE LOGITS
extinct
0.96
obsolete
0.93
entangled
0.93
unman
0.87
accustomed
0.86
embroiled
0.84
clearer
0.83
irrelevant
0.82
synonymous
0.82
indistinguishable
0.82
Activations Density 0.053%