INDEX
Explanations
Opinions and arguments
This neuron activates on admonitory second-person conditional phrases, especially “next time you…” constructions.
New Auto-Interp
Negative Logits
setId
-0.07
肯定
-0.07
ываются
-0.07
Bean
-0.07
messagebox
-0.07
FAR
-0.07
insecure
-0.06
ěti
-0.06
Num
-0.06
fanatic
-0.06
POSITIVE LOGITS
paněl
0.07
Millionen
0.07
پای
0.07
.idx
0.06
milano
0.06
珠
0.06
.''↵↵
0.06
thụ
0.06
.amazon
0.06
sammen
0.06
Activations Density 0.049%