INDEX
    Explanations

    references to specific characters or names

    New Auto-Interp
    Negative Logits
     laureate
    -0.63
    FACE
    -0.62
    weight
    -0.62
     poppy
    -0.61
     Winchester
    -0.60
    oire
    -0.60
     Tempest
    -0.59
    FORMATION
    -0.59
     dividend
    -0.59
    ORGE
    -0.58
    POSITIVE LOGITS
    umar
    1.31
    unin
    1.28
    ansas
    1.24
    ernel
    1.21
    nown
    1.18
    rish
    1.17
    htar
    1.08
    ileaks
    1.06
    owski
    1.04
    arak
    1.03
    Act Density 0.672%

    No Known Activations