INDEX
Explanations
instances of the word "among" and its variations, indicating a focus on group or collective contexts
New Auto-Interp
Negative Logits
ising
-0.17
aries
-0.17
itz
-0.16
eri
-0.15
burg
-0.15
orsch
-0.14
ential
-0.14
eros
-0.14
osit
-0.13
oca
-0.13
POSITIVE LOGITS
st
0.36
those
0.21
Equals
0.20
equals
0.20
Ñģобой
0.20
sted
0.19
est
0.19
s
0.19
пÑĢоÑĩ
0.18
themselves
0.18
Activations Density 0.033%