INDEX
    Explanations

    names of people or entities

    New Auto-Interp
    Negative Logits
    EMS
    -0.79
    yrinth
    -0.72
    romy
    -0.72
    iasm
    -0.71
    icult
    -0.68
    aunders
    -0.68
    iths
    -0.68
    istar
    -0.67
    psey
    -0.65
    nuts
    -0.65
    POSITIVE LOGITS
    plates
    1.44
    plate
    1.29
    paces
    1.07
     redacted
    0.94
    names
    0.94
     tag
    0.94
     tags
    0.93
    ames
    0.91
     names
    0.88
     recognition
    0.88
    Act Density 1.975%

    No Known Activations