INDEX
    Explanations

    assurances and statements conveying certainty or emphasis

    expressions of assurance or promises

    New Auto-Interp
    Negative Logits
    ynski
    -0.76
    adesh
    -0.73
    abouts
    -0.69
    isol
    -0.65
     sided
    -0.62
    ikes
    -0.61
    chery
    -0.60
    earable
    -0.59
     Includes
    -0.59
    artifacts
    -0.58
    POSITIVE LOGITS
     readers
    1.10
     anybody
    1.01
     anyone
    1.01
     thee
    1.01
     ya
    1.00
     listeners
    0.99
     you
    0.94
     viewers
    0.94
     myself
    0.92
     believers
    0.91
    Act Density 0.106%

    No Known Activations