INDEX
Explanations
properties, features
This neuron activates on adjectives and adverbs that describe performance metrics or favorable qualities (e.g. wide, excellent, fast, high, low, broad, superior).
New Auto-Interp
Negative Logits
وسی
-0.07
XC
-0.07
phon
-0.07
maintained
-0.07
Сем
-0.06
_support
-0.06
mh
-0.06
Regex
-0.06
اری
-0.06
!:
-0.06
POSITIVE LOGITS
ersten
0.06
.Navigator
0.06
Brightness
0.06
彼女
0.06
später
0.06
.atomic
0.06
后
0.06
cube
0.06
společně
0.06
.He
0.06
Activations Density 0.292%