INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     peppers
    -0.06
     oscill
    -0.06
    (row
    -0.06
     regime
    -0.06
     Hole
    -0.06
     chi
    -0.06
    -0.06
     tisíc
    -0.06
     Hobby
    -0.06
    POSITIVE LOGITS
     outlook
    0.07
    ("↵
    0.07
    _ONCE
    0.07
     ActionType
    0.06
     histoire
    0.06
    trusted
    0.06
    (reverse
    0.06
    //{↵
    0.06
     =========================================================================
    0.06
     Pai
    0.06
    Act Density 0.001%

    No Known Activations