INDEX
Explanations
pronouns followed by a possessive pronoun and a verb, indicating actions related to possession and relationships
references to specific individuals and personal pronouns
New Auto-Interp
Negative Logits
ãĥIJ
-0.81
ĸļ
-0.79
ieri
-0.76
hesda
-0.69
idelines
-0.69
iquette
-0.64
izzard
-0.63
inance
-0.62
omial
-0.62
antle
-0.61
POSITIVE LOGITS
nightmares
0.91
senses
0.88
classmates
0.80
imag
0.79
imagining
0.78
guess
0.78
doubts
0.78
witnessed
0.77
awoke
0.76
watched
0.75
Activations Density 0.837%