INDEX
Explanations
words related to strong language or intensity
instances of the word "rhetoric."
New Auto-Interp
Negative Logits
ITNESS
-0.68
TAIN
-0.63
IER
-0.62
essa
-0.60
atch
-0.60
ECK
-0.60
atar
-0.60
emale
-0.59
cot
-0.58
cop
-0.58
POSITIVE LOGITS
rhetoric
0.95
flare
0.79
mith
0.76
emanating
0.76
flared
0.73
surrounding
0.73
tir
0.71
denouncing
0.71
spew
0.71
flares
0.70
Activations Density 0.026%