INDEX
    Explanations

    phrases indicating a statement or declaration being made

    expressions of caution or reluctance to provide information

    New Auto-Interp
    Negative Logits
    uses
    -0.74
    ngth
    -0.73
    itton
    -0.73
    ranch
    -0.73
    igor
    -0.67
    endium
    -0.66
    aughs
    -0.66
     Juda
    -0.66
    pes
    -0.66
    fired
    -0.65
    POSITIVE LOGITS
     anymore
    1.19
     anything
    1.08
     specifics
    0.93
     nor
    0.88
     exactly
    0.84
     goodbye
    0.83
     publicly
    0.83
     definitively
    0.81
     anybody
    0.77
     anywhere
    0.74
    Act Density 0.089%

    No Known Activations