INDEX
Explanations
the word "have" in various forms and contexts
New Auto-Interp
Negative Logits
esco
-0.15
irts
-0.15
_FATAL
-0.14
finder
-0.14
irse
-0.14
ano
-0.14
å°Ķ
-0.14
Když
-0.14
258
-0.14
arin
-0.13
POSITIVE LOGITS
reason
0.28
options
0.25
choices
0.23
nowhere
0.22
until
0.22
OPTIONS
0.21
bigger
0.21
to
0.19
permission
0.19
Options
0.19
Activations Density 0.167%