INDEX
    Explanations

    mentions of Israel and related terms or entities

    New Auto-Interp
    Negative Logits
    enna
    -0.15
    /Private
    -0.14
    /install
    -0.14
    teenth
    -0.14
    agar
    -0.14
    zzle
    -0.14
    dre
    -0.14
    ãĥijãĥ³
    -0.14
    ulk
    -0.14
    umbo
    -0.14
    POSITIVE LOGITS
    led
    0.21
    ognito
    0.17
    gnu
    0.16
     Defense
    0.16
    rael
    0.15
    گاÙĨ
    0.14
    gın
    0.14
    son
    0.14
    nat
    0.14
     Nathan
    0.14
    Act Density 0.015%

    No Known Activations