INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     DAY
    -0.07
     tag
    -0.07
     Day
    -0.07
     flooded
    -0.07
    _basic
    -0.07
     listed
    -0.07
     holiday
    -0.07
     Winter
    -0.07
     rund
    -0.06
    22
    -0.06
    POSITIVE LOGITS
     perception
    0.12
     perceive
    0.11
     Perception
    0.11
     perceptions
    0.11
     perceived
    0.09
     percept
    0.08
    .assertIn
    0.08
     perce
    0.08
     내가
    0.07
    _perc
    0.07
    Act Density 0.010%

    No Known Activations