INDEX
Explanations
variations of the verb "to be" in different contexts
New Auto-Interp
Negative Logits
,
-0.70
and
-0.59
,?
-0.55
,}
-0.55
e
-0.54
however
-0.53
i
-0.50
<bos>
-0.48
l
-0.48
n
-0.47
POSITIVE LOGITS
been
2.02
been
1.49
got
1.47
Been
1.35
Been
1.32
gotta
1.31
BEEN
1.30
gotten
1.26
become
1.04
ollut
1.03
Activations Density 0.191%