INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _CUBE
    -0.07
    -0.07
    Surv
    -0.07
     fatty
    -0.07
     Gulf
    -0.07
     dán
    -0.06
     Simpl
    -0.06
    aliz
    -0.06
    pio
    -0.06
    .Integer
    -0.06
    POSITIVE LOGITS
     Beck
    0.18
    beck
    0.13
     Becker
    0.12
     beck
    0.12
     Beckham
    0.09
     Becky
    0.08
     Buck
    0.07
    idebar
    0.07
     Wick
    0.06
     setback
    0.06
    Act Density 0.001%

    No Known Activations