INDEX
Explanations
mentions of things being heavily targeted or heavily emphasized
words related to overwhelming or excessive experiences
New Auto-Interp
Negative Logits
slave
-0.73
arel
-0.71
lim
-0.66
olen
-0.65
Ct
-0.65
chal
-0.63
anse
-0.63
ource
-0.62
asury
-0.62
emetery
-0.62
POSITIVE LOGITS
ously
0.69
DERR
0.67
onite
0.66
ĪĴ
0.66
paste
0.65
ruciating
0.65
bombard
0.65
ï¸
0.64
ishly
0.64
Blumenthal
0.61
Activations Density 0.126%