INDEX
Explanations
Named entities
This neuron primarily activates on proper nouns—names of people, places, or organizations.
New Auto-Interp
Negative Logits
Judiciary
-0.07
mug
-0.07
जब
-0.06
.inline
-0.06
тради
-0.06
igm
-0.06
restricting
-0.06
fod
-0.06
Ups
-0.06
UP
-0.06
POSITIVE LOGITS
atomic
0.07
+'/
0.07
IllegalAccessException
0.06
-watch
0.06
_sep
0.06
Boris
0.06
"]==
0.06
,strlen
0.06
<!--<
0.06
ヽ
0.06
Activations Density 0.060%