INDEX
Explanations
the word "each" in various contexts
New Auto-Interp
Negative Logits
er
-0.64
SUT
-0.62
im
-0.62
Goy
-0.61
pinos
-0.60
shit
-0.57
Bon
-0.57
fores
-0.56
Wiseman
-0.56
ter
-0.56
POSITIVE LOGITS
EACH
2.30
each
2.23
EACH
2.18
Each
2.12
each
2.11
Each
2.08
Chaque
1.84
Chaque
1.74
chaque
1.66
ciasc
1.59
Activations Density 0.101%