INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     confisc
    -0.06
     soruml
    -0.06
    CONTROL
    -0.06
    backgroundColor
    -0.06
    .False
    -0.06
    保持
    -0.06
    []↵
    -0.06
    acement
    -0.06
    aları
    -0.06
     wszyst
    -0.06
    POSITIVE LOGITS
     Heat
    0.07
     stret
    0.07
     QR
    0.07
    gene
    0.07
     inf
    0.06
    cene
    0.06
     DNA
    0.06
     المن
    0.06
    lias
    0.06
     perf
    0.06
    Act Density 0.008%

    No Known Activations