INDEX
    Explanations

    references to Gaza and the conflict surrounding it

    New Auto-Interp
    Negative Logits
    .si
    -0.16
     Ngb
    -0.14
    trl
    -0.14
    ियर
    -0.14
    tape
    -0.14
    WXYZ
    -0.14
     ivory
    -0.14
    Eb
    -0.13
    лож
    -0.13
     mot
    -0.13
    POSITIVE LOGITS
     Electricity
    0.14
    ownik
    0.14
    yntax
    0.14
    ÄŁa
    0.14
     cash
    0.13
    AuthToken
    0.13
    شاÙĩ
    0.13
    _tra
    0.13
    .vertx
    0.13
    antlr
    0.13
    Act Density 0.010%

    No Known Activations