INDEX
    Explanations

    references to significant news events and incidents related to safety and protection

    New Auto-Interp
    Negative Logits
    timewa
    -0.58
    Datuak
    -0.57
    .*")]
    -0.57
     surla
    -0.54
    alej
    -0.52
    LikeLike
    -0.51
     specificity
    -0.48
    -0.47
     wikipagina
    -0.47
    cupertino
    -0.47
    POSITIVE LOGITS
     kasarigan
    0.64
    Tikang
    0.61
    ValueStyle
    0.60
    pexpr
    0.59
    intios
    0.56
    StoryboardSegue
    0.52
    PhysRev
    0.51
    SharedCtor
    0.51
     künftig
    0.51
    Spoljašnje
    0.51
    Act Density 0.341%

    No Known Activations