INDEX
Explanations
instances of the verb "have" in various forms
New Auto-Interp
Negative Logits
artner
-0.16
915
-0.15
499
-0.14
IfNeeded
-0.14
278
-0.14
ιδ
-0.13
Into
-0.13
bun
-0.13
ëĭ¥
-0.13
906
-0.13
POSITIVE LOGITS
to
0.69
να
0.39
to
0.31
ãĤĴ
0.30
to
0.28
Äijá»ĥ
0.28
zu
0.26
anz
0.25
ÑĩÑĤобÑĭ
0.25
ãĢįãĤĴ
0.24
Activations Density 0.152%