INDEX
Explanations
questions asking for opinions or thoughts
the phrase "What do you think," indicating a focus on soliciting opinions or thoughts
New Auto-Interp
Negative Logits
Kat
-0.68
arc
-0.68
cases
-0.67
Alexandria
-0.67
apo
-0.66
icy
-0.65
Carroll
-0.65
planes
-0.64
photos
-0.64
Ange
-0.64
POSITIVE LOGITS
guys
1.31
're
0.97
've
0.89
tub
0.87
contrace
0.87
know
0.85
sugg
0.82
think
0.78
intend
0.78
fare
0.77
Activations Density 0.049%