INDEX
Explanations
Direct speech attributions with 'said' followed by a specific person's name
New Auto-Interp
Negative Logits
berus
-0.65
pend
-0.64
\)
-0.60
phia
-0.60
gencies
-0.58
/
-0.58
DragonMagazine
-0.58
mentation
-0.58
Fit
-0.57
Availability
-0.56
POSITIVE LOGITS
sarcast
1.20
rhet
1.02
onstage
1.00
bluntly
0.99
during
0.94
emphatically
0.90
angrily
0.87
forcefully
0.86
afterward
0.85
aloud
0.82
Activations Density 0.109%