INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ео
    -0.39
     bezig
    -0.38
    -0.38
    banyak
    -0.36
     tra
    -0.35
     Conway
    -0.35
    -0.35
    matricula
    -0.35
    Her
    -0.34
     practiced
    -0.34
    POSITIVE LOGITS
    الدراسه
    0.69
    AddTagHelper
    0.66
     trillion
    0.61
    ConstraintMaker
    0.60
     dollar
    0.60
     billions
    0.59
     Chwiliwch
    0.59
     للمعارف
    0.59
     $\$
    0.58
     billion
    0.55
    Act Density 0.011%

    No Known Activations