INDEX
Explanations
instances of the word "while" in various grammatical forms
New Auto-Interp
Negative Logits
/umd
-0.17
ays
-0.16
nik
-0.15
rea
-0.14
_TAC
-0.14
ieren
-0.14
üp
-0.14
ny
-0.14
ÑĨик
-0.14
ior
-0.13
POSITIVE LOGITS
s
0.21
certainly
0.15
there
0.15
there
0.15
may
0.15
may
0.14
acker
0.14
uable
0.14
THERE
0.14
akens
0.14
Activations Density 0.027%