INDEX
Explanations
direct speech attributions
repeated phrases indicating attribution or quotation, specifically the word "said" and its variations
New Auto-Interp
Negative Logits
à¦
-0.78
otin
-0.76
ãĥ¼ãĥ
-0.76
ãĥ¡
-0.71
tumblr
-0.70
\/\/
-0.68
1966
-0.65
Phys
-0.64
à¹
-0.64
ã쮿
-0.64
POSITIVE LOGITS
hement
0.79
ulty
0.66
anecd
0.66
seq
0.64
mington
0.62
heit
0.61
doms
0.61
ieu
0.60
orney
0.60
($)
0.59
Activations Density 0.269%