INDEX
Explanations
quotes or statements made by public figures
New Auto-Interp
Negative Logits
ILCS
-0.76
brance
-0.72
harvest
-0.71
sew
-0.71
Located
-0.70
isite
-0.69
LTD
-0.69
MAP
-0.69
geries
-0.68
profits
-0.66
POSITIVE LOGITS
sarcast
1.31
rhet
1.23
remarks
1.21
apologizing
1.16
angrily
1.14
praising
1.13
joking
1.12
Asked
1.10
apologized
1.09
reiterated
1.08
Activations Density 3.348%