INDEX
Explanations
phrases related to possessiveness or ownership
references to one's personal perspective or ownership of opinions and experiences
New Auto-Interp
Negative Logits
olic
-0.74
osate
-0.73
iso
-0.72
obar
-0.71
wark
-0.69
ibaba
-0.68
sylvania
-0.68
Dates
-0.68
ccording
-0.67
ENDED
-0.67
POSITIVE LOGITS
backyard
0.96
personal
0.93
creations
0.92
accord
0.91
selves
0.86
mortality
0.84
pockets
0.84
biases
0.83
demise
0.82
merits
0.81
Activations Density 0.037%