INDEX
Explanations
instances of the word 'said' followed by a descriptive word
instances of the word "said" and its variants
New Auto-Interp
Negative Logits
ntil
-0.74
xtap
-0.70
potion
-0.70
ILCS
-0.67
Ranked
-0.67
EDIT
-0.67
poon
-0.66
plete
-0.66
oufl
-0.65
pleting
-0.65
POSITIVE LOGITS
goodbye
0.92
doms
0.74
earlier
0.73
sarcast
0.72
anecd
0.69
afterward
0.67
sonian
0.65
aloud
0.65
respondent
0.63
afterwards
0.62
Activations Density 0.138%