INDEX
Explanations
occurrences of the pronoun "it" and variations in its usage
New Auto-Interp
Negative Logits
kers
-0.15
ersen
-0.14
æĽ
-0.14
ÙĨØ´
-0.14
undry
-0.14
dit
-0.13
iny
-0.13
iatrics
-0.13
depart
-0.13
usal
-0.13
POSITIVE LOGITS
soon
0.25
weren
0.25
wasn
0.24
soon
0.23
quickly
0.19
therefore
0.19
shouldn
0.19
Soon
0.19
Soon
0.18
quick
0.18
Activations Density 0.098%