INDEX
    Explanations

    phrases indicating significant impacts or effects, especially those that are negative or chilling in nature

    New Auto-Interp
    Negative Logits
    hooting
    -0.74
    owship
    -0.73
     Lands
    -0.64
    Reviewer
    -0.63
    bis
    -0.62
     aware
    -0.61
     Fighting
    -0.60
    bys
    -0.60
    Train
    -0.60
    estern
    -0.59
    POSITIVE LOGITS
     implications
    1.13
     effect
    1.12
     impact
    1.10
     reperc
    1.07
     ramifications
    1.07
     significance
    1.06
     repercussions
    1.04
     consequences
    0.98
     resonance
    0.96
     origins
    0.96
    Act Density 0.149%

    No Known Activations