INDEX
    Explanations

    phrases related to potential negative events or disasters

    contexts related to events or emergencies

    New Auto-Interp
    Negative Logits
    kefeller
    -0.90
    educated
    -0.70
    pees
    -0.70
    sort
    -0.66
     srfAttach
    -0.66
    ptives
    -0.65
    ometry
    -0.65
    ilib
    -0.64
    Pub
    -0.64
    Dub
    -0.64
    POSITIVE LOGITS
     malf
    1.14
     emergencies
    1.05
     misfortune
    1.04
     malfunction
    1.02
     unforeseen
    1.01
     unexpectedly
    1.00
     stray
    0.99
     disagreement
    0.97
     misconduct
    0.96
     mish
    0.93
    Act Density 0.785%

    No Known Activations