INDEX
    Explanations

    names and specific identifying terms

    New Auto-Interp
    Negative Logits
    icult
    -0.76
    yrinth
    -0.74
    aunders
    -0.74
    EMS
    -0.73
    isexual
    -0.73
    romy
    -0.71
    istar
    -0.69
    iasm
    -0.68
    psey
    -0.66
    nuts
    -0.65
    POSITIVE LOGITS
    plates
    1.49
    plate
    1.34
    paces
    1.15
    ames
    0.95
    paced
    0.92
    names
    0.89
     tag
    0.87
     recognition
    0.87
     aliases
    0.86
    akes
    0.84
    Act Density 0.614%

    No Known Activations