INDEX
    Explanations

    names of people and potentially organizations

    New Auto-Interp
    Negative Logits
     Carnage
    -0.67
     ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
    -0.66
     lesbians
    -0.65
     Masquerade
    -0.63
     Somali
    -0.63
     Sigma
    -0.62
     [&
    -0.62
    cknowled
    -0.62
     Skydragon
    -0.62
    initialized
    -0.62
    POSITIVE LOGITS
    odore
    0.75
    dinand
    0.75
    romeda
    0.69
    alin
    0.69
    ison
    0.68
    entin
    0.67
    berman
    0.67
    rick
    0.67
    undy
    0.65
    espie
    0.65
    Act Density 0.217%

    No Known Activations