INDEX
Explanations
The neuron detects words containing the suffix “ant,” especially chemical‐agent or functional nouns ending in “ant.”
New Auto-Interp
Negative Logits
<Product
-0.07
renders
-0.07
alas
-0.06
scratch
-0.06
rendered
-0.06
unread
-0.06
Another
-0.06
disrespectful
-0.06
?option
-0.06
Lightweight
-0.06
POSITIVE LOGITS
ecom
0.07
inspectors
0.07
Nietzsche
0.07
чих
0.06
�
0.06
(__
0.06
транспор
0.06
воб
0.06
cor
0.06
melody
0.06
Activations Density 0.030%