INDEX
Explanations
The neuron activates on occurrences of the word “read” or complaints about being able to read text (i.e. readability concerns).
New Auto-Interp
Negative Logits
验证
-0.07
.logo
-0.07
frase
-0.06
Disc
-0.06
progressbar
-0.06
.blog
-0.06
jours
-0.06
pants
-0.06
FA
-0.06
PackageManager
-0.06
POSITIVE LOGITS
SignUp
0.07
topLevel
0.06
neğin
0.06
二二
0.06
cripcion
0.06
pthread
0.06
leftright
0.06
Drivers
0.06
atherine
0.06
zh
0.06
Activations Density 0.021%