INDEX
    Explanations

    contexts involving health-related incidents and injuries

    New Auto-Interp
    Negative Logits
    EGA
    -0.18
    VisualStyle
    -0.18
    dete
    -0.17
    eric
    -0.16
    bard
    -0.16
    çĽijåIJ¬é¡µéĿ¢
    -0.16
    éĺħ读次æķ°
    -0.16
    lique
    -0.16
    $LANG
    -0.15
    styleType
    -0.15
    POSITIVE LOGITS
     rot
    0.20
     uniform
    0.18
     Rot
    0.17
     gr
    0.17
     
    0.16
     al
    0.16
     uniformly
    0.15
    ifen
    0.15
     for
    0.15
     mean
    0.15
    Act Density 0.047%

    No Known Activations