INDEX
    Explanations

    names of people, particularly those with last names commonly associated with color (e.g., White, Black, Green)

    the last names of notable individuals

    New Auto-Interp
    Negative Logits
     VIDE
    -0.68
     srfAttach
    -0.66
     polarization
    -0.62
     psi
    -0.61
     Haram
    -0.60
     Serie
    -0.59
     BOX
    -0.59
     Hydra
    -0.58
     Pastebin
    -0.58
     masked
    -0.58
    POSITIVE LOGITS
    stein
    1.36
    berger
    1.34
    croft
    1.34
    gren
    1.32
    cott
    1.32
    hill
    1.30
    worth
    1.29
    baum
    1.29
    field
    1.28
    burn
    1.27
    Act Density 0.154%

    No Known Activations