INDEX
Explanations
statements involving someone telling information about themselves or others
statements of past actions or experiences involving individuals
New Auto-Interp
Negative Logits
Nationwide
-0.64
Mandatory
-0.63
Translation
-0.62
stakes
-0.61
ormal
-0.59
Header
-0.59
Regions
-0.59
nesses
-0.58
Globe
-0.58
duc
-0.57
POSITIVE LOGITS
regretted
1.27
regrets
1.21
'd
1.05
slept
1.04
hadn
1.02
dreamed
0.98
woke
0.96
wished
0.95
hates
0.94
remembers
0.94
Activations Density 0.185%