INDEX
    Explanations

    romance novels

    This neuron responds to words and phrases that denote extreme wealth or high social status (e.g. rich, billionaire, tycoon).

    New Auto-Interp
    Negative Logits
     faster
    -0.07
    ana
    -0.07
     LDAP
    -0.06
    inning
    -0.06
     retrieved
    -0.06
     Ben
    -0.06
    Bad
    -0.06
    Solar
    -0.06
     Chris
    -0.06
    ugu
    -0.06
    POSITIVE LOGITS
    Navigator
    0.07
    ––
    0.07
     توص
    0.06
    urgence
    0.06
    ]").
    0.06
    _pcm
    0.06
     innoc
    0.06
     looph
    0.06
    ENTITY
    0.06
     traces
    0.06
    Act Density 0.091%

    No Known Activations