INDEX
Explanations
questions or statements involving someone being asked about a specific topic
instances of the word "asked" indicating inquiries or requests for information
New Auto-Interp
Negative Logits
marine
-0.68
Confeder
-0.63
Right
-0.62
Sham
-0.62
Est
-0.62
rats
-0.61
Cock
-0.61
bows
-0.61
Liber
-0.60
ARM
-0.60
POSITIVE LOGITS
rhet
1.03
asked
0.91
govtrack
0.89
naires
0.84
questions
0.83
ioned
0.82
permission
0.79
probing
0.77
questioned
0.77
repeatedly
0.74
Activations Density 0.018%