INDEX
Explanations
questions or statements related to particular individuals discussing various topics
interrogative sentences or questions
New Auto-Interp
Negative Logits
metics
-0.70
cipled
-0.67
cartel
-0.65
acan
-0.65
uno
-0.64
indicted
-0.59
lifes
-0.59
run
-0.58
lance
-0.58
conom
-0.57
POSITIVE LOGITS
Answer
1.27
YES
0.84
Yes
0.84
Interview
0.83
RH
0.83
VB
0.78
Answer
0.77
Yeah
0.77
yes
0.74
Question
0.74
Activations Density 0.203%