INDEX
Explanations
The neuron detects mentions of attaining fame or popularity (e.g. “gained … fame,” “raggiunto la popolarità”).
New Auto-Interp
Negative Logits
tumors
-0.06
Invocation
-0.06
gangbang
-0.06
başında
-0.06
comps
-0.06
يرا
-0.06
.trailingAnchor
-0.06
ibox
-0.06
_pose
-0.06
پرونده
-0.06
POSITIVE LOGITS
Networks
0.07
eth
0.07
/close
0.06
فيها
0.06
immel
0.06
证
0.06
Ob
0.06
enheim
0.06
(propertyName
0.06
Василь
0.06
Activations Density 0.035%