INDEX
Explanations
romance novels
This neuron responds to words and phrases that denote extreme wealth or high social status (e.g. rich, billionaire, tycoon).
New Auto-Interp
Negative Logits
faster
-0.07
ana
-0.07
LDAP
-0.06
inning
-0.06
retrieved
-0.06
Ben
-0.06
Bad
-0.06
Solar
-0.06
Chris
-0.06
ugu
-0.06
POSITIVE LOGITS
Navigator
0.07
––
0.07
توص
0.06
urgence
0.06
]").
0.06
_pcm
0.06
innoc
0.06
looph
0.06
ENTITY
0.06
traces
0.06
Activations Density 0.091%