INDEX
Explanations
This neuron responds to uses of the pronoun “it.”
New Auto-Interp
Negative Logits
Guild
-0.07
="#"><
-0.07
"How
-0.07
'">
-0.07
Flutter
-0.06
جميع
-0.06
{
↵-0.06
“How
-0.06
특히
-0.06
natal
-0.06
POSITIVE LOGITS
they
0.08
he
0.07
it
0.07
He
0.07
It
0.07
تف
0.07
asoci
0.07
.They
0.07
It
0.07
.It
0.06
Activations Density 0.089%