INDEX
Explanations
quotations from different individuals
instances of reported speech
New Auto-Interp
Negative Logits
pend
-0.79
fruit
-0.69
pend
-0.67
otin
-0.67
paralle
-0.66
manag
-0.65
thur
-0.64
theless
-0.63
conflic
-0.62
blem
-0.62
POSITIVE LOGITS
sarcast
0.88
bluntly
0.86
rhet
0.83
referring
0.79
afterward
0.72
angrily
0.71
emphatically
0.71
quoting
0.71
aloud
0.69
referencing
0.68
Activations Density 0.129%