INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Bac
    -0.08
     intangible
    -0.08
     sober
    -0.08
     ọrụ
    -0.08
     Anglo
    -0.08
    forth
    -0.07
     Sib
    -0.07
     judged
    -0.07
     strap
    -0.07
     Impression
    -0.07
    POSITIVE LOGITS
     occurrences
    0.08
     loy
    0.08
    Occurrences
    0.08
    occ
    0.08
    /count
    0.07
     cam
    0.07
    root
    0.07
     Loy
    0.07
    -occ
    0.07
    次数
    0.07
    Act Density 0.005%

    No Known Activations