INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .b
    -0.08
     container
    -0.07
    Tab
    -0.07
     nim
    -0.07
     press
    -0.07
     survivors
    -0.07
    [A
    -0.07
     stable
    -0.07
     or
    -0.07
     Minis
    -0.06
    POSITIVE LOGITS
     apl
    0.09
    -third
    0.08
     ভাগ
    0.08
    -thirds
    0.08
    agé
    0.08
    0.08
    gebied
    0.08
    -finals
    0.08
     gete
    0.08
     Ό
    0.08
    Act Density 0.005%

    No Known Activations