INDEX
Explanations
The neuron activates on occurrences of “sheep” (including its “ep” subtoken), i.e. it detects mentions of the animal sheep.
New Auto-Interp
Negative Logits
ิทย
-0.07
廠
-0.07
dáng
-0.07
písem
-0.07
Roland
-0.06
วร
-0.06
X
-0.06
filmpjes
-0.06
['.
-0.06
siêu
-0.06
POSITIVE LOGITS
sheep
0.15
Sheep
0.12
Shepherd
0.10
lamb
0.09
goat
0.09
flock
0.09
shepherd
0.08
Lamb
0.08
091
0.08
Goat
0.07
Activations Density 0.006%