INDEX
Explanations
the pronoun "I" followed by verbs or hypothetical situations
New Auto-Interp
Negative Logits
Apart
-0.66
MpServer
-0.66
Reviewer
-0.60
irlf
-0.58
Balt
-0.57
*/(
-0.56
Alternative
-0.56
Volunteers
-0.56
srfAttach
-0.56
emonic
-0.54
POSITIVE LOGITS
'm
1.23
verson
0.89
hadn
0.84
am
0.80
myself
0.80
ever
0.79
ggy
0.79
xtap
0.76
liam
0.76
wasn
0.75
Activations Density 0.061%