INDEX
Explanations
personal pronouns or pronouns referring to people
pronouns referring to subjects and their actions
New Auto-Interp
Negative Logits
odor
-0.63
Seah
-0.63
Oral
-0.63
advertisement
-0.61
Houses
-0.61
Tomorrow
-0.60
verting
-0.60
Amen
-0.59
Yesterday
-0.58
Balls
-0.58
POSITIVE LOGITS
'll
0.96
've
0.90
'd
0.86
forg
0.77
learnt
0.76
streng
0.75
wont
0.75
sych
0.74
lder
0.74
izen
0.73
Activations Density 0.221%