INDEX
Explanations
Groups or sets
The neuron activates on plural nouns (words referring to multiple items, typically ending in “s”).
New Auto-Interp
Negative Logits
icates
-0.07
Bangalore
-0.07
Jay
-0.07
Naturally
-0.06
recognizable
-0.06
StringUtils
-0.06
-object
-0.06
Transformers
-0.06
.Red
-0.06
different
-0.06
POSITIVE LOGITS
RYPT
0.07
_aut
0.07
refl
0.06
ีน
0.06
Hij
0.06
عرض
0.06
ΟΣ
0.06
urn
0.06
좀
0.06
viewType
0.06
Activations Density 0.054%