INDEX
    Explanations

    names and locations related to criminal activities or legal cases

    New Auto-Interp
    Negative Logits
    lz
    -0.16
     neutron
    -0.16
    ائÙĦ
    -0.15
    isay
    -0.15
    reck
    -0.15
    ivy
    -0.15
    ervas
    -0.15
    atz
    -0.14
    ourg
    -0.14
     Beng
    -0.14
    POSITIVE LOGITS
     Delta
    0.16
     Bunu
    0.15
    Delta
    0.15
     paramount
    0.15
    mouseup
    0.15
     Ih
    0.14
    erin
    0.14
     internally
    0.14
    chter
    0.14
    _capabilities
    0.14
    Act Density 0.042%

    No Known Activations