INDEX
    Explanations

    names of specific individuals

    New Auto-Interp
    Negative Logits
    library
    -0.79
    Madison
    -0.73
     Dagger
    -0.72
     Cock
    -0.67
    EDIT
    -0.66
     Blink
    -0.64
     Cats
    -0.64
     Fiction
    -0.63
     Shiite
    -0.62
    rising
    -0.62
    POSITIVE LOGITS
     Mata
    0.89
    unia
    0.87
    fer
    0.82
    iversal
    0.82
    ÃŃa
    0.80
    acho
    0.79
     Gaal
    0.77
    orr
    0.77
    ijn
    0.76
    ilies
    0.74
    Act Density 0.017%

    No Known Activations