INDEX
    Explanations

    mentions of sabotage and state actors

    New Auto-Interp
    Negative Logits
     Amb
    -0.15
    intColor
    -0.14
    urum
    -0.14
     rapor
    -0.13
    awy
    -0.13
    FOUNDATION
    -0.13
     geçen
    -0.13
    archs
    -0.13
    uche
    -0.13
    ĥĿ
    -0.13
    POSITIVE LOGITS
     somehow
    0.15
     chron
    0.14
     mini
    0.14
    leta
    0.14
     Toro
    0.14
    urn
    0.14
    ActivityIndicatorView
    0.13
     realized
    0.13
    erval
    0.13
     te
    0.13
    Act Density 0.390%

    No Known Activations