INDEX
    Explanations

    instances where a speculative or hypothetical outcome is being discussed

    conditional phrasing and hypotheticals

    New Auto-Interp
    Negative Logits
    wrote
    -0.66
    fighting
    -0.63
    action
    -0.62
     Reporting
    -0.59
    Lind
    -0.59
     Is
    -0.58
     Salman
    -0.58
     touring
    -0.57
     Ren
    -0.57
     Sale
    -0.57
    POSITIVE LOGITS
     suffice
    1.08
    ŃĶ
    0.92
     overshadow
    0.92
    ©¶æ
    0.90
     be
    0.89
     imply
    0.84
     conflic
    0.83
     distort
    0.83
     complicate
    0.83
     alleviate
    0.77
    Act Density 0.367%

    No Known Activations