INDEX
    Explanations

    various forms of the word "type."

    New Auto-Interp
    Negative Logits
    vern
    -0.17
    (disposing
    -0.17
    wm
    -0.14
    .thumb
    -0.14
     Angelo
    -0.14
    808
    -0.14
    ãĥ¼ãĥ¬
    -0.14
    erno
    -0.14
    idot
    -0.14
    duce
    -0.14
    POSITIVE LOGITS
    ault
    0.20
    trl
    0.17
     ep
    0.17
    _NR
    0.16
    epy
    0.15
    vyk
    0.15
     Ward
    0.15
    iage
    0.15
    enet
    0.15
    jab
    0.15
    Act Density 0.021%

    No Known Activations