INDEX
Explanations
pronouns and verbs indicating hypothetical scenarios
second-person and first-person pronouns, indicating a focus on direct address or personal connection in discussions
New Auto-Interp
Negative Logits
ces
-0.93
76561
-0.80
opens
-0.76
got
-0.72
edIn
-0.68
mining
-0.68
ooks
-0.66
started
-0.66
ca
-0.66
assemb
-0.64
POSITIVE LOGITS
succeed
1.09
be
1.06
suffice
1.03
continue
0.96
survive
0.95
concede
0.94
abandon
0.94
accept
0.92
migrate
0.92
persist
0.90
Activations Density 0.077%