INDEX
    Explanations

    references to potential threats or dangerous situations

    New Auto-Interp
    Negative Logits
     effe
    -1.26
     increa
    -1.25
     emphat
    -1.24
     reluct
    -1.23
     maneu
    -1.21
     ?...
    -1.20
     unden
    -1.20
     snoopy
    -1.18
     impra
    -1.18
     strick
    -1.17
    POSITIVE LOGITS
    enderror
    0.68
     someday
    0.68
     anytime
    0.64
     tomorrow
    0.63
    κτηρισ
    0.62
    oward
    0.59
     either
    0.59
    ribune
    0.58
     anywhere
    0.58
    calipsis
    0.57
    Act Density 0.703%

    No Known Activations