INDEX
Explanations
personal pronouns indicating possession or ownership
occurrences of the pronouns "I" and "my"
New Auto-Interp
Negative Logits
Meier
-0.72
olute
-0.68
neapolis
-0.67
rencies
-0.66
nesota
-0.66
kW
-0.65
itude
-0.65
ouston
-0.65
imil
-0.65
itect
-0.63
POSITIVE LOGITS
bara
0.81
Cats
0.69
âĻ
0.68
RL
0.66
presume
0.66
brig
0.65
ahar
0.65
oan
0.63
udic
0.63
ona
0.63
Activations Density 0.395%