INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     materia
    -0.08
    (Page
    -0.07
     broadcasting
    -0.07
    ,nonatomic
    -0.07
    Film
    -0.07
    vatel
    -0.07
    postId
    -0.06
     tym
    -0.06
     Removal
    -0.06
     belli
    -0.06
    POSITIVE LOGITS
     Healthy
    0.11
     healthy
    0.11
    healthy
    0.11
    Healthy
    0.09
    κου
    0.08
    _GU
    0.07
    0.07
     unhealthy
    0.07
     Lean
    0.06
     Î
    0.06
    Act Density 0.013%

    No Known Activations