INDEX
    Explanations

    infringement

    New Auto-Interp
    Negative Logits
    cont
    -0.07
     morals
    -0.06
    _loss
    -0.06
    .sqrt
    -0.06
    _bits
    -0.06
    /msg
    -0.06
     pob
    -0.06
     optics
    -0.06
    (dl
    -0.06
     getClient
    -0.06
    POSITIVE LOGITS
     infringement
    0.13
     infring
    0.11
     infr
    0.10
     Inf
    0.07
     εγκα
    0.07
    0.07
    INFRINGEMENT
    0.07
     Fram
    0.06
     decre
    0.06
     skirm
    0.06
    Act Density 0.002%

    No Known Activations