INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    PDO
    -0.08
    pheres
    -0.08
    aviours
    -0.07
    WIN
    -0.06
     silicon
    -0.06
     phi
    -0.06
    woo
    -0.06
    father
    -0.06
    loh
    -0.06
    zheimer
    -0.06
    POSITIVE LOGITS
     recently
    0.12
     Recently
    0.11
     recent
    0.10
     Recent
    0.09
     unexpected
    0.09
     lately
    0.08
    Recently
    0.08
    recent
    0.08
     Latest
    0.08
     frequent
    0.08
    Act Density 0.021%

    No Known Activations