INDEX
    Explanations

    references to the speaker or writer's personal experiences and opinions

    New Auto-Interp
    Negative Logits
    ught
    -0.17
    abouts
    -0.16
    burgh
    -0.16
    æľ«
    -0.16
    .fm
    -0.16
    avenport
    -0.16
    eno
    -0.14
    UGHT
    -0.14
    å´
    -0.14
    μή
    -0.14
    POSITIVE LOGITS
    DEX
    0.16
    sth
    0.15
    ronic
    0.15
     Spy
    0.15
    edere
    0.14
    968
    0.14
    ryn
    0.14
     consent
    0.14
     mage
    0.14
    elts
    0.14
    Act Density 0.182%

    No Known Activations