INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Immutable
    -0.08
     nullable
    -0.08
     zum
    -0.08
    _consts
    -0.08
    buff
    -0.08
    ime
    -0.08
    íg
    -0.07
    Nullable
    -0.07
    ignite
    -0.07
    partment
    -0.07
    POSITIVE LOGITS
    (loop
    0.09
     loop
    0.08
     cello
    0.08
     LOOP
    0.08
     praised
    0.08
     Loop
    0.08
    0.07
    Natur
    0.07
     denunci
    0.07
     Elevator
    0.07
    Act Density 0.000%

    No Known Activations