INDEX
Explanations
instances of the verb "have" and its variants indicating possession or experience
New Auto-Interp
Negative Logits
ึà¹Ī
-0.18
azar
-0.16
136
-0.15
stát
-0.15
ugen
-0.15
aat
-0.14
animated
-0.14
rodi
-0.14
ARSE
-0.13
athers
-0.13
POSITIVE LOGITS
such
0.21
nt
0.20
it
0.20
a
0.19
finally
0.19
to
0.18
this
0.18
tons
0.18
no
0.18
zero
0.18
Activations Density 0.310%