INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    className
    -0.07
    .general
    -0.07
    biology
    -0.06
    594
    -0.06
     Austria
    -0.06
    -0.06
     sociology
    -0.06
    既然
    -0.06
     дов
    -0.06
     nursing
    -0.06
    POSITIVE LOGITS
     έν
    0.08
     uměl
    0.07
     espec
    0.07
     rehabilit
    0.07
    enting
    0.06
    venes
    0.06
    interpreted
    0.06
     queryset
    0.06
    	HAL
    0.06
    ।↵
    0.06
    Act Density 0.150%

    No Known Activations