INDEX
    Explanations

    word phrases indicating expressions, statements, or declarations

    instances of the word "telling."

    New Auto-Interp
    Negative Logits
    urdue
    -0.83
    cdn
    -0.80
    uld
    -0.74
    san
    -0.70
    engeance
    -0.69
    nam
    -0.69
    mun
    -0.68
    emetery
    -0.68
    Jump
    -0.67
    rane
    -0.67
    POSITIVE LOGITS
    tale
    1.17
    ingly
    0.88
    tell
    0.77
     tales
    0.75
    tons
    0.72
     Tell
    0.70
     tale
    0.70
     us
    0.66
     aloud
    0.66
     tell
    0.65
    Act Density 0.013%

    No Known Activations