INDEX
Explanations
personal pronouns and possessive pronouns suggesting ownership
pronouns related to personal or collective identity and experience
New Auto-Interp
Negative Logits
Rig
-0.71
math
-0.69
cussion
-0.67
Cliff
-0.66
hess
-0.64
Remastered
-0.63
Dud
-0.63
Adv
-0.62
Voyager
-0.61
Ju
-0.60
POSITIVE LOGITS
encount
1.11
encountered
1.04
've
0.97
deems
0.94
deem
0.91
learned
0.87
encounter
0.85
learnt
0.84
deemed
0.83
accumulated
0.83
Activations Density 0.140%