INDEX
Explanations
The neuron activates on common German function words and filler particles, effectively signaling that the text is in German.
New Auto-Interp
Negative Logits
ENTA
-0.07
report
-0.07
couz
-0.07
Director
-0.07
con
-0.07
Keep
-0.07
icons
-0.07
jections
-0.07
_NR
-0.06
Element
-0.06
POSITIVE LOGITS
noch
0.07
'util
0.07
niet
0.07
vlastně
0.07
jedoch
0.06
nur
0.06
-peer
0.06
nicht
0.06
вот
0.06
bitte
0.06
Activations Density 0.050%