INDEX
    Explanations

    role and item definition

    New Auto-Interp
    Negative Logits
     তিনজন
    0.44
    Commission
    0.43
     spectrum
    0.42
    Spectrum
    0.42
     Adams
    0.42
     conseguenza
    0.41
     Emin
    0.41
    Cliff
    0.41
     Pul
    0.41
    Continued
    0.40
    POSITIVE LOGITS
    extré
    0.58
    0.55
    ceğ
    0.54
    oras
    0.52
    ísim
    0.52
    umers
    0.52
     airways
    0.51
     extré
    0.50
    ată
    0.49
    ตัวเอง
    0.49
    Act Density 0.000%

    No Known Activations