INDEX
Explanations
personal actions or experiences related to statements
first-person pronouns and associated phrases indicating personal experiences or actions
New Auto-Interp
Negative Logits
eers
-0.82
notations
-0.71
fits
-0.65
quartered
-0.64
alist
-0.62
geries
-0.62
cannabin
-0.62
pins
-0.61
orians
-0.61
pes
-0.60
POSITIVE LOGITS
revis
0.85
embarked
0.85
toured
0.85
visited
0.84
celebrated
0.83
inaug
0.82
encount
0.82
tweeted
0.82
resur
0.80
traveled
0.80
Activations Density 0.262%