INDEX
Explanations
personal pronouns followed by verbs indicating action
pronouns and personal references
New Auto-Interp
Negative Logits
srfAttach
-0.82
Elsewhere
-0.79
Others
-0.79
Others
-0.75
Amen
-0.75
Else
-0.73
soType
-0.72
Another
-0.71
Additional
-0.71
Lastly
-0.69
POSITIVE LOGITS
literally
0.91
've
0.82
asma
0.80
WANT
0.78
invented
0.76
want
0.76
basically
0.75
simply
0.73
genuinely
0.73
relentlessly
0.72
Activations Density 0.409%