INDEX
Explanations
the word "or" used in various contexts
New Auto-Interp
Negative Logits
Äįil
-0.16
ilor
-0.16
505
-0.15
iled
-0.15
ils
-0.15
ãĥ¥ãĥ¼
-0.14
ãĤĨ
-0.14
imized
-0.14
ilt
-0.14
notated
-0.14
POSITIVE LOGITS
two
0.31
two
0.24
couple
0.23
-two
0.21
两个
0.20
deux
0.20
zwei
0.20
few
0.20
两
0.19
åħ©
0.19
Activations Density 0.022%