INDEX
    Explanations

    the word "only" with a focus on emphasizing limitations or exclusivity

    phrases expressing the idea of something being partially complete or insufficient

    New Auto-Interp
    Negative Logits
    ros
    -0.74
    ducers
    -0.66
    idon
    -0.63
    rigan
    -0.63
    wealth
    -0.62
    rosis
    -0.61
    rote
    -0.61
    insula
    -0.60
    atana
    -0.60
    hement
    -0.59
    POSITIVE LOGITS
     marginally
    1.08
     kidding
    0.79
     scratched
    0.78
     temporary
    0.73
     partially
    0.72
     scratching
    0.68
     allowed
    0.67
     accessible
    0.67
     temporarily
    0.66
     indirectly
    0.66
    Act Density 0.064%

    No Known Activations