INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.09
     Colonial
    -0.08
    colon
    -0.08
     Colon
    -0.08
     Thur
    -0.08
    Colon
    -0.08
     Keeps
    -0.08
     Lud
    -0.08
    -Core
    -0.08
     besø
    -0.08
    POSITIVE LOGITS
    hew
    0.07
    Functor
    0.07
    408
    0.07
     wi
    0.07
    0.07
    0.07
     batting
    0.07
    ahrenheit
    0.06
     Barker
    0.06
     award
    0.06
    Act Density 0.001%

    No Known Activations