INDEX
Explanations
The neuron detects tokens expressing the concept of waiting (e.g. “待ちます,” “待,” “wait”).
New Auto-Interp
Negative Logits
ίνεται
-0.07
_FIRE
-0.07
POSIX
-0.06
해
-0.06
matching
-0.06
frustrations
-0.06
проблема
-0.06
INDOW
-0.06
todas
-0.06
nước
-0.06
POSITIVE LOGITS
Shortcut
0.07
로
0.06
.↵↵
0.06
ublic
0.06
economic
0.06
nickname
0.06
(trim
0.06
Sections
0.06
#$
0.06
docks
0.06
Activations Density 0.028%