INDEX
Explanations
phrases indicating examples or samples of various items or concepts
instances of the word "such" followed by explanations or examples
New Auto-Interp
Negative Logits
ertodd
-0.76
zl
-0.72
olate
-0.71
itudinal
-0.71
ipedia
-0.70
rition
-0.67
oil
-0.67
hene
-0.66
Drum
-0.66
kick
-0.66
POSITIVE LOGITS
ties
0.78
minded
0.68
consequential
0.66
cond
0.66
should
0.62
aggreg
0.60
abundantly
0.60
things
0.60
minded
0.59
constituted
0.59
Activations Density 0.050%