INDEX

Explanations

Stable Diffusion, neural networks, gender identity

The neuron selectively activates on capitalized tokens—proper nouns, acronyms, and other named or technical terms.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

и

0.67

 없고

0.65

 और

0.63

 και

0.63

 এবং

0.59

و

0.59

और

0.58

 torno

0.56

và

0.55

 અને

0.55

POSITIVE LOGITS

 respectively

0.62

 ஆகியவற்ற

0.59

 alike

0.59

 ஆகிய

0.57

 sebagainya

0.57

\,.

0.56

 respectivamente

0.55

!).

0.53

 പ്രത്യേ

0.53

Activations Density 0.001%