INDEX
    Explanations

    terms related to terrorism and national security

    New Auto-Interp
    Negative Logits
    asma
    -0.16
    abant
    -0.16
     Film
    -0.15
    lich
    -0.15
     detective
    -0.14
    osy
    -0.14
     Hlav
    -0.14
    inki
    -0.14
    ietf
    -0.14
    èĨľ
    -0.14
    POSITIVE LOGITS
    .Dom
    0.17
    žel
    0.15
     ex
    0.14
    .intellij
    0.14
    icz
    0.14
    meli
    0.14
    *pow
    0.14
     yet
    0.14
    semblies
    0.13
    olan
    0.13
    Act Density 0.069%

    No Known Activations