INDEX
Explanations
instances of the word "each" and related terms indicating distribution or individual items in a set
New Auto-Interp
Negative Logits
amen
-0.17
igan
-0.15
iverse
-0.15
iw
-0.15
less
-0.14
eigen
-0.13
possessions
-0.13
resources
-0.13
ric
-0.13
Meh
-0.13
POSITIVE LOGITS
each
0.18
respective
0.18
(each
0.16
each
0.16
ê°ģ
0.15
ãĥ³ãĥij
0.15
ergy
0.15
ebek
0.15
Each
0.14
каждого
0.14
Activations Density 0.058%