INDEX
Explanations
a variety of terms related to quantities, locations and attributes.
the word "further"
New Auto-Interp
Negative Logits
hey
-0.44
acher
-0.42
Hey
-0.41
BeforeMethod
-0.41
miyor
-0.40
zou
-0.39
wikk
-0.39
мен
-0.39
-0.38
evitando
-0.38
POSITIVE LOGITS
further
2.09
further
2.00
Further
1.75
Further
1.70
FURTHER
1.59
FURTHER
1.41
urther
1.37
进一步
1.27
farther
1.11
verder
1.09
Activations Density 3.023%