INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Southwest
    -0.08
    _drag
    -0.08
    urricanes
    -0.08
    Prem
    -0.07
    Eh
    -0.07
     eh
    -0.07
    .animate
    -0.07
    Drag
    -0.07
     vt
    -0.07
    .sim
    -0.07
    POSITIVE LOGITS
     diligent
    0.09
    aled
    0.08
    0.08
    criber
    0.08
    0.08
    kle
    0.08
    0.07
    CEF
    0.07
     effectivement
    0.07
     desk
    0.07
    Act Density 0.005%

    No Known Activations