INDEX
Explanations
the neuron activates on words that describe spatial orientation and positioning (e.g. facing, behind, front, back, sideways, angles).
explicit sexual content and descriptions related to intimate situations.
New Auto-Interp
Negative Logits
effort
-0.07
les
-0.07
vera
-0.07
Gio
-0.06
вар
-0.06
Breath
-0.06
yeme
-0.06
_mirror
-0.06
thesized
-0.06
textu
-0.06
POSITIVE LOGITS
face
0.07
Francie
0.06
-facing
0.06
Ames
0.06
};↵
0.06
forControlEvents
0.06
๕
0.06
-Nov
0.06
facing
0.06
seek
0.06
Activations Density 0.012%