INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hancock
    -0.10
    awk
    -0.09
    anga
    -0.09
     McInt
    -0.09
    ADM
    -0.09
    ãģ£
    -0.09
    asn
    -0.09
     aroused
    -0.08
    ze
    -0.08
     unm
    -0.08
    POSITIVE LOGITS
     appeal
    0.47
     Appeal
    0.39
     appeals
    0.35
     appealed
    0.29
     Appeals
    0.28
    appe
    0.28
    Appe
    0.27
     appealing
    0.26
     reson
    0.25
     appe
    0.19
    Act Density 0.173%

    No Known Activations