INDEX
Explanations
phrases describing significant changes or transformations
occurrences of the word "changed" in various contexts
New Auto-Interp
Negative Logits
mination
-0.70
alty
-0.68
stra
-0.67
hold
-0.67
Vs
-0.64
beit
-0.63
ictionary
-0.62
verse
-0.61
ç«
-0.61
otropic
-0.61
POSITIVE LOGITS
dramatically
0.88
drastically
0.83
ĸļ
0.83
amorph
0.80
psychiat
0.74
elvet
0.72
ilver
0.70
perceptions
0.69
Consent
0.69
abruptly
0.69
Activations Density 0.047%