INDEX
Explanations
phrases indicating personal opinions
phrases expressing personal views or opinions
New Auto-Interp
Negative Logits
llular
-0.69
Himself
-0.64
raltar
-0.64
ULAR
-0.62
Topic
-0.61
perty
-0.60
artney
-0.59
Yourself
-0.59
Weld
-0.59
Ton
-0.57
POSITIVE LOGITS
phas
0.74
unres
0.69
owitz
0.68
sts
0.68
terminating
0.60
inval
0.59
wrongful
0.58
opian
0.57
affirmative
0.57
aesthetic
0.56
Activations Density 0.038%