INDEX
Explanations
flexible
This neuron activates on occurrences of the adjective “flexible” in technical or patent‐style text.
New Auto-Interp
Negative Logits
,说
-0.07
Spells
-0.06
쳐
-0.06
raid
-0.06
toString
-0.06
eaten
-0.06
236
-0.06
nah
-0.06
Out
-0.06
НА
-0.06
POSITIVE LOGITS
flexible
0.14
Flexible
0.11
Flexible
0.10
flexibility
0.10
_dual
0.09
Flex
0.08
flex
0.08
"<?
0.07
mpg
0.07
Fuse
0.07
Activations Density 0.009%