INDEX
Explanations
This neuron detects occurrences of the word “back” (often in locational phrases like “back yard,” “back of…”).
New Auto-Interp
Negative Logits
ebilir
-0.07
ค
-0.07
204
-0.07
א
-0.06
IPC
-0.06
Wolverine
-0.06
GetHashCode
-0.06
iştir
-0.06
Suicide
-0.06
xml
-0.06
POSITIVE LOGITS
roof
0.06
front
0.06
carro
0.06
hw
0.06
fron
0.06
Fah
0.06
back
0.06
tip
0.06
leader
0.06
Tail
0.06
Activations Density 0.034%