INDEX
    Explanations

    phrases indicating recommendations or suggestions

    statements expressing obligations or recommendations

    New Auto-Interp
    Negative Logits
    reality
    -0.70
     Puzzle
    -0.69
    locked
    -0.67
    atile
    -0.67
     Trails
    -0.65
    ITED
    -0.61
    Soul
    -0.61
    ãĥŃ
    -0.59
     Vers
    -0.58
    cule
    -0.58
    POSITIVE LOGITS
     be
    0.94
    ered
    0.92
     ideally
    0.85
     preferably
    0.82
    ering
    0.81
     rightfully
    0.80
     surely
    0.78
     definitely
    0.78
    othal
    0.77
     probably
    0.76
    Act Density 0.059%

    No Known Activations