INDEX
Explanations
references to ownership or possession
New Auto-Interp
Negative Logits
pins
-0.60
ตร์
-0.52
xk
-0.51
phalt
-0.51
admonition
-0.48
Pries
-0.48
and
-0.47
cuts
-0.47
afstand
-0.47
rewind
-0.47
POSITIVE LOGITS
Their
2.09
their
2.03
their
1.94
Their
1.92
THEIR
1.82
themselves
1.75
themselves
1.58
thier
1.51
they
1.45
theirs
1.43
Activations Density 0.092%