INDEX
Explanations
the word "only" and its variations
New Auto-Interp
Negative Logits
lez
-0.17
èĩ³å°ij
-0.16
atleast
-0.16
irty
-0.15
least
-0.15
lej
-0.15
ajo
-0.14
rame
-0.14
anka
-0.14
Nin
-0.14
POSITIVE LOGITS
handful
0.25
few
0.21
partial
0.20
partially
0.19
one
0.18
recently
0.17
Partial
0.17
two
0.17
limited
0.16
fraction
0.16
Activations Density 0.078%