INDEX
Explanations
personal possessive pronouns, "yours" and "ours"
New Auto-Interp
Negative Logits
Beg
-0.69
IPM
-0.66
hips
-0.64
edd
-0.64
orsi
-0.63
itement
-0.62
atever
-0.62
establishment
-0.62
ypes
-0.61
iencies
-0.60
POSITIVE LOGITS
selves
1.12
self
1.07
craft
0.87
field
0.76
fields
0.74
creen
0.72
opia
0.72
!/
0.72
cart
0.71
ovie
0.69
Activations Density 0.016%