INDEX
Explanations
occurrences of the word "have."
New Auto-Interp
Negative Logits
zano
-0.17
237
-0.17
osoph
-0.15
uto
-0.15
indo
-0.15
ito
-0.15
amin
-0.15
ÙĨاÙħÙĩ
-0.14
amina
-0.14
omentum
-0.14
POSITIVE LOGITS
duty
0.15
_DD
0.15
Duty
0.15
obl
0.14
cbc
0.14
lotte
0.14
orgh
0.13
.resp
0.13
ational
0.13
段
0.13
Activations Density 0.151%