INDEX
Explanations
scientific, technical
This neuron activates on domain-specific technical nouns and jargon (e.g., specialized multi-syllable content words).
New Auto-Interp
Negative Logits
ConfigurationManager
-0.07
articles
-0.06
article
-0.06
Tokenizer
-0.06
IDENT
-0.06
sizes
-0.06
certification
-0.06
chuyện
-0.06
Matters
-0.06
ysize
-0.06
POSITIVE LOGITS
травня
0.08
áj
0.07
.ImageField
0.07
()=>{↵0.07
liked
0.06
oji
0.06
arac
0.06
allegedly
0.06
GWei
0.06
loan
0.06
Activations Density 0.245%