INDEX
    Explanations

    phrases indicating time-related actions and states

    New Auto-Interp
    Negative Logits
    uet
    -0.16
    idden
    -0.15
     Atlantis
    -0.15
    ensis
    -0.15
    iid
    -0.15
    .Requires
    -0.14
    нож
    -0.14
    åīij
    -0.14
    azÄĥ
    -0.14
    enga
    -0.14
    POSITIVE LOGITS
    .jquery
    0.16
     Dome
    0.15
    angen
    0.14
    ture
    0.14
     dó
    0.14
     Abb
    0.14
     late
    0.14
    93
    0.14
     H
    0.13
    ekl
    0.13
    Act Density 0.052%

    No Known Activations