INDEX
Explanations
the occurrence of the word "been" in various contexts
New Auto-Interp
Negative Logits
Being
-0.29
being
-0.29
being
-0.28
still
-0.27
Being
-0.27
-being
-0.25
STILL
-0.23
still
-0.23
被
-0.22
Still
-0.20
POSITIVE LOGITS
/is
0.27
lately
0.26
recently
0.24
through
0.23
around
0.23
previously
0.21
since
0.21
Recently
0.19
able
0.19
Recently
0.19
Activations Density 0.130%