INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Imaging
    -0.08
     imaging
    -0.07
     К
    -0.07
    mark
    -0.07
    К
    -0.07
    ибка
    -0.07
     $
    -0.07
    _FORCE
    -0.07
    marks
    -0.07
    ):
    -0.07
    POSITIVE LOGITS
     Strict
    0.08
    Strict
    0.08
     strictly
    0.08
    ifique
    0.08
    Minn
    0.07
    0.07
     Execute
    0.07
     assigning
    0.07
     countdown
    0.07
     vini
    0.07
    Act Density 0.006%

    No Known Activations