INDEX
Explanations
News articles
The neuron activates on evaluative or opinion-bearing words (the kind of adverbs, adjectives, and modals that mark subjective commentary).
New Auto-Interp
Negative Logits
.nombre
-0.07
farewell
-0.07
.Format
-0.06
Kag
-0.06
DONE
-0.06
rè
-0.06
annotations
-0.06
budgets
-0.06
Saudi
-0.06
hani
-0.06
POSITIVE LOGITS
上涨
0.07
ательных
0.06
NAFTA
0.06
meses
0.06
↵↵
0.06
cần
0.06
wished
0.06
})();↵
0.06
всем
0.06
properly
0.06
Activations Density 0.023%