INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     uses
    -1.15
     Uses
    -1.12
     mild
    -1.03
    Uses
    -1.02
     Mild
    -0.92
     USES
    -0.89
    mild
    -0.80
    HasAnnotation
    -0.79
     AssemblyTitle
    -0.77
     utilizes
    -0.76
    POSITIVE LOGITS
    est
    0.66
    es
    0.58
    s
    0.54
    in
    0.51
    nings
    0.48
     majority
    0.47
    zką
    0.47
    ude
    0.47
     büy
    0.46
    y
    0.46
    Act Density 0.285%

    No Known Activations