INDEX
    Explanations

    education, administration, or official proceedings

    New Auto-Interp
    Negative Logits
     ưu
    0.42
     fp
    0.38
    เชื่อ
    0.38
    拥有
    0.37
     Orient
    0.37
     Marvel
    0.36
     bật
    0.36
    皮膚
    0.36
     কারণেই
    0.35
    0.35
    POSITIVE LOGITS
     Kham
    0.46
     prophylactic
    0.42
    viol
    0.41
    лыми
    0.40
     manda
    0.40
    Khal
    0.39
    branches
    0.39
    amy
    0.39
    steering
    0.38
     tabpos
    0.38
    Act Density 0.001%

    No Known Activations