INDEX
    Explanations

    numerical values or identifiers

    New Auto-Interp
    Negative Logits
     Luther
    -0.77
    tis
    -0.76
     Flor
    -0.75
     Ariel
    -0.73
     Venezuel
    -0.70
    ε
    -0.69
     Pengu
    -0.66
    ģĸ
    -0.66
    riel
    -0.65
     Alvin
    -0.64
    POSITIVE LOGITS
    escape
    0.71
    ictive
    0.69
    iculty
    0.69
    hire
    0.67
    acia
    0.67
    erent
    0.66
    cially
    0.65
    ournament
    0.65
    ADA
    0.64
    enced
    0.64
    Act Density 0.000%

    No Known Activations