INDEX
Explanations
This neuron activates on occurrences of the phrase “presented the same way as in the document,” i.e. when checking consistency of entity name presentation.
New Auto-Interp
Negative Logits
apas
-0.08
isure
-0.07
христи
-0.07
melanch
-0.07
-sensitive
-0.06
pil
-0.06
OLER
-0.06
शत
-0.06
Explorer
-0.06
्वत
-0.06
POSITIVE LOGITS
마지막
0.07
mA
0.06
BlockSize
0.06
(group
0.06
confidently
0.06
가정
0.06
렇게
0.06
...)
0.06
.callback
0.06
Ind
0.05
Activations Density 0.001%