INDEX
    Explanations

    modal verbs and expressions of necessity or obligation

    New Auto-Interp
    Negative Logits
    öh
    -0.16
    iry
    -0.16
    xba
    -0.15
     Guaranteed
    -0.15
     Panic
    -0.15
    itary
    -0.14
     Overnight
    -0.14
    jom
    -0.14
    Ùĩر
    -0.14
    ember
    -0.14
    POSITIVE LOGITS
     admit
    0.24
     Wonder
    0.20
     wonder
    0.19
     confess
    0.18
    adr
    0.17
     confession
    0.17
     admitting
    0.17
    polator
    0.16
     warn
    0.16
     æīĭ
    0.16
    Act Density 0.036%

    No Known Activations