INDEX
Explanations
politics
The neuron detects mentions of political discourse, especially the words “politics” and “political.”
New Auto-Interp
Negative Logits
каз
-0.08
glyph
-0.07
독
-0.07
кат
-0.06
درخواست
-0.06
캐
-0.06
Sync
-0.06
RAF
-0.06
Coch
-0.06
Jo
-0.06
POSITIVE LOGITS
_Product
0.06
_public
0.06
.Errors
0.06
üslü
0.06
ัณฑ
0.06
constant
0.06
čím
0.06
flate
0.06
forg
0.06
-lived
0.06
Activations Density 0.009%