INDEX
    Explanations

    phrases related to hypothetical situations or conditions

    phrases expressing hypothetical scenarios and their consequences

    New Auto-Interp
    Negative Logits
     disclaimer
    -0.70
    DOC
    -0.64
     outline
    -0.63
     UL
    -0.62
    cius
    -0.59
     intangible
    -0.59
     scept
    -0.58
     DOC
    -0.57
     clarify
    -0.56
     reminder
    -0.55
    POSITIVE LOGITS
    schild
    0.93
     bothered
    0.83
     mattered
    0.82
     anymore
    0.78
    oppers
    0.74
     nor
    0.73
     necessarily
    0.71
    ajor
    0.69
    icable
    0.68
     tolerated
    0.68
    Act Density 0.122%

    No Known Activations