INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Greater
    0.47
     Considerable
    0.45
    Greater
    0.44
     considerable
    0.43
    0.39
    aksikan
    0.39
     idéal
    0.38
    0.38
     '>
    0.37
     متحدہ
    0.37
    POSITIVE LOGITS
     conservative
    0.64
    conservative
    0.61
    Conserv
    0.60
    Conservative
    0.59
    conserv
    0.57
     konserv
    0.57
     conservatives
    0.55
     conserv
    0.54
     Conservative
    0.51
    保守
    0.51
    Act Density 0.000%

    No Known Activations