INDEX
Explanations
instances of the word "or" in various contexts
New Auto-Interp
Negative Logits
elli
-0.16
937
-0.15
ä¸Ķ
-0.15
enticate
-0.15
eenth
-0.15
enstein
-0.15
971
-0.14
redient
-0.14
973
-0.14
asmine
-0.14
POSITIVE LOGITS
few
0.19
longer
0.19
more
0.18
few
0.18
so
0.18
cas
0.18
less
0.17
sooner
0.16
iddi
0.15
two
0.15
Activations Density 0.021%