INDEX
Explanations
occurrences of the word "have" and its variations
New Auto-Interp
Negative Logits
زÛĮ
-0.16
eniable
-0.15
pis
-0.15
ali
-0.14
artner
-0.14
ando
-0.14
ikel
-0.14
pio
-0.14
ISTA
-0.14
liest
-0.13
POSITIVE LOGITS
to
0.40
besoin
0.22
να
0.21
'gc
0.17
to
0.17
’ta
0.16
Äijá»ĥ
0.16
warts
0.16
contempt
0.16
needs
0.15
Activations Density 0.068%