INDEX
Explanations
instances of the word "any" and its variations across different contexts
New Auto-Interp
Negative Logits
Some
-0.16
some
-0.16
Various
-0.16
quelques
-0.16
ä¸ĢäºĽ
-0.15
rick
-0.15
SOME
-0.15
Multiple
-0.15
einige
-0.14
eview
-0.14
POSITIVE LOGITS
/all
0.35
THING
0.30
ones
0.30
place
0.29
sort
0.28
kind
0.27
kind
0.25
one
0.24
thin
0.23
ONE
0.22
Activations Density 0.090%