INDEX
Explanations
the presence of variations of the verb "have" in different contexts
New Auto-Interp
Negative Logits
istr
-0.14
ynchronously
-0.13
setHidden
-0.13
utura
-0.13
amburger
-0.12
641
-0.12
اÙĪØª
-0.12
Ñģок
-0.12
rewritten
-0.12
((((
-0.12
POSITIVE LOGITS
questions
0.30
ever
0.25
any
0.24
concerns
0.23
Questions
0.22
children
0.22
doubts
0.21
questions
0.21
kids
0.20
existing
0.20
Activations Density 0.136%