INDEX
Explanations
phrases indicating ability or possibility
conditional phrases that express the ability or possibility of action
New Auto-Interp
Negative Logits
senal
-0.76
fired
-0.64
raged
-0.63
Choice
-0.63
Strikes
-0.62
Politics
-0.62
anamo
-0.61
Ez
-0.61
ELD
-0.61
behavi
-0.60
POSITIVE LOGITS
't
1.62
afford
1.45
convince
1.15
muster
1.09
persuade
1.05
manage
1.05
locate
1.02
find
0.97
tolerate
0.94
reproduce
0.93
Activations Density 0.094%