INDEX
Explanations
personal pronouns 'you' as the target
references to the second-person perspective and personal experiences
New Auto-Interp
Negative Logits
ipal
-0.68
Agriculture
-0.64
Sabha
-0.61
ice
-0.60
Commerce
-0.59
Reconstruction
-0.59
apo
-0.57
Course
-0.57
aughed
-0.57
ĸļ
-0.57
POSITIVE LOGITS
tub
1.34
guys
1.20
're
1.17
RS
1.14
Tube
0.90
've
0.83
hei
0.82
'll
0.81
NG
0.81
ldon
0.76
Activations Density 0.115%