INDEX
Explanations
phrases related to personal possessions and actions
possessive pronouns
New Auto-Interp
Negative Logits
ĸļ
-0.87
drawn
-0.77
nil
-0.74
noon
-0.73
dding
-0.72
taker
-0.71
draw
-0.69
vine
-0.68
put
-0.68
quart
-0.68
POSITIVE LOGITS
entire
1.43
whole
1.01
own
0.88
offending
0.86
selves
0.82
ãĥ³ãĤ¸
0.81
entirety
0.80
respective
0.78
goods
0.78
precious
0.76
Activations Density 0.365%