INDEX
Explanations
personal pronouns or verbs indicating action or decision-making
pronouns and expressions of assertion
New Auto-Interp
Negative Logits
Exit
-0.75
rocket
-0.71
culosis
-0.70
grass
-0.69
venge
-0.68
tops
-0.65
pause
-0.65
renheit
-0.63
classic
-0.63
shock
-0.62
POSITIVE LOGITS
stated
0.68
noted
0.65
overest
0.65
hath
0.64
seems
0.64
aforementioned
0.63
perenn
0.63
ather
0.63
conceded
0.63
concedes
0.62
Activations Density 0.404%