INDEX
    Explanations

    phrases indicating restraint or inhibition

    phrases indicating restraint or delay

    New Auto-Interp
    Negative Logits
    ²¾
    -0.84
    swick
    -0.75
    ivia
    -0.74
    aundering
    -0.71
    aters
    -0.71
    ioxide
    -0.71
    ceans
    -0.69
    apest
    -0.68
    ģ«
    -0.66
    iterranean
    -0.66
    POSITIVE LOGITS
    hold
    0.78
     ransom
    0.78
     hold
    0.77
     tight
    0.73
     hostage
    0.70
     reins
    0.70
     sway
    0.65
     ledge
    0.64
     grip
    0.64
     plun
    0.63
    Act Density 0.082%

    No Known Activations