INDEX
Explanations
personal pronouns followed by sentiments or actions
first-person pronouns and expressions of personal sentiment or experience
New Auto-Interp
Negative Logits
imum
-0.69
rift
-0.68
alky
-0.65
igmatic
-0.63
ruct
-0.63
utenberg
-0.62
Economic
-0.61
Street
-0.60
rontal
-0.60
Skip
-0.60
POSITIVE LOGITS
'll
1.21
've
1.09
consequently
1.04
therefore
0.97
'm
0.96
shouldn
0.96
accordingly
0.94
'd
0.92
wondered
0.91
couldn
0.90
Activations Density 0.144%