INDEX
Explanations
phrases indicating quotations or reported speech
instances of people making statements or declarations
New Auto-Interp
Negative Logits
taboola
-0.75
perate
-0.69
ILCS
-0.69
physical
-0.69
ugal
-0.66
peg
-0.65
mol
-0.64
otive
-0.63
Holy
-0.63
frac
-0.63
POSITIVE LOGITS
bluntly
0.98
sarcast
0.97
aloud
0.84
emphatically
0.81
omin
0.76
unequivocally
0.75
boldly
0.71
è£ıè
0.69
rhet
0.67
arten
0.66
Activations Density 0.167%