INDEX
Explanations
personal pronouns and verbs indicating personal actions
references to personal experiences and relationships
New Auto-Interp
Negative Logits
atory
-0.61
Millennium
-0.60
Gad
-0.60
Innocent
-0.59
Wald
-0.58
endorsement
-0.55
Globe
-0.55
Aston
-0.55
Emerson
-0.54
Amen
-0.53
POSITIVE LOGITS
'll
1.29
've
1.25
're
1.22
'd
1.10
dunno
1.05
'm
0.97
haven
0.96
can
0.96
forgot
0.93
don
0.92
Activations Density 0.544%