INDEX
Explanations
phrases related to actions or activities that the individual has been doing
the phrase "I have been" in various contexts
New Auto-Interp
Negative Logits
lies
-0.72
izable
-0.69
rones
-0.68
terday
-0.68
Must
-0.67
ives
-0.66
iop
-0.66
regate
-0.65
Guy
-0.64
idental
-0.64
POSITIVE LOGITS
subjected
1.03
able
1.03
unable
1.03
tasked
0.95
accused
0.93
forgiven
0.91
criticized
0.91
warned
0.89
punished
0.89
rewarded
0.89
Activations Density 0.126%