INDEX
    Explanations

    names of specific individuals

    proper nouns, particularly names of individuals

    New Auto-Interp
    Negative Logits
    imately
    -0.83
    ashtra
    -0.74
    vous
    -0.65
    ï¸ı
    -0.64
    yip
    -0.63
     broom
    -0.63
    BLIC
    -0.62
    minecraft
    -0.61
     Haram
    -0.59
     LEDs
    -0.59
    POSITIVE LOGITS
    asley
    0.67
    Marsh
    0.64
    ukes
    0.64
    canon
    0.63
     Kelley
    0.63
    vine
    0.60
    mberg
    0.59
     Cullen
    0.57
    oyer
    0.57
     Johnston
    0.57
    Act Density 0.088%

    No Known Activations