INDEX
    Explanations

    keywords denoting a difference or unique characteristic

    New Auto-Interp
    Negative Logits
     enough
    -0.71
     control
    -0.65
     recovery
    -0.63
     savings
    -0.63
     management
    -0.63
     close
    -0.62
     values
    -0.60
     cap
    -0.60
     access
    -0.59
     arm
    -0.59
    POSITIVE LOGITS
    also
    3.13
    sometimes
    1.82
    often
    1.67
    formerly
    1.53
    actually
    1.48
    along
    1.46
    both
    1.43
    usually
    1.42
    again
    1.37
    literally
    1.33
    Act Density 0.016%

    No Known Activations