INDEX
Explanations
personal pronouns 'I' or 'we' followed by hypothetical situations
first-person singular pronouns in various contexts
New Auto-Interp
Negative Logits
MpServer
-0.66
Reviewer
-0.64
ULTS
-0.60
Balt
-0.58
FAR
-0.58
Neutral
-0.57
Paradise
-0.57
mobi
-0.56
Gleaming
-0.56
Sharp
-0.56
POSITIVE LOGITS
ever
0.85
'm
0.76
succeed
0.75
weren
0.74
wanna
0.74
hypot
0.74
hadn
0.73
agu
0.72
someday
0.72
EVER
0.71
Activations Density 0.073%