INDEX
    Explanations

    phrases related to consequences, especially related to payments or obligations

    New Auto-Interp
    Negative Logits
     Moroc
    -0.69
     Barney
    -0.67
    luster
    -0.66
    bledon
    -0.61
     Fern
    -0.60
    ADRA
    -0.60
     Dres
    -0.58
     Klein
    -0.57
    Gall
    -0.57
     Seym
    -0.56
    POSITIVE LOGITS
     ][
    0.79
     Reincarnated
    0.70
    "?
    0.66
    ihad
    0.65
     )))
    0.63
    nt
    0.62
    \":
    0.62
    oi
    0.61
    soType
    0.60
    rael
    0.59
    Act Density 0.351%

    No Known Activations