INDEX
    Explanations

    phrases indicating the speaker's viewpoint or explanation

    phrases indicating opinions or assertions from various sources

    New Auto-Interp
    Negative Logits
    cffffcc
    -0.76
    empt
    -0.73
    plug
    -0.68
    acts
    -0.63
     enriched
    -0.63
     bailed
    -0.63
     acad
    -0.63
    riched
    -0.63
     aven
    -0.63
    icates
    -0.62
    POSITIVE LOGITS
     Polly
    0.73
     Compass
    0.73
     historian
    0.72
     Jonathan
    0.71
     NYT
    0.70
     Stef
    0.69
    pace
    0.69
     Mattis
    0.68
     Joel
    0.68
     Laura
    0.68
    Act Density 0.049%

    No Known Activations