INDEX
Explanations
the verb "be" in various contexts and grammatical structures
New Auto-Interp
Negative Logits
being
-0.26
being
-0.25
Being
-0.22
-being
-0.22
Being
-0.21
被
-0.19
ability
-0.19
ABLE
-0.19
Ability
-0.18
èĥ½å¤Ł
-0.18
POSITIVE LOGITS
traced
0.26
easily
0.26
anything
0.25
liken
0.23
safely
0.22
anywhere
0.21
found
0.21
compared
0.21
seen
0.20
anything
0.20
Activations Density 0.125%