INDEX
    Explanations

    specific names and proper nouns, especially related to individuals and entities in the context of competitive sports

    New Auto-Interp
    Negative Logits
     enfans
    -0.65
    انتهای
    -0.49
     resourceCulture
    -0.47
     ویکی‌پدی
    -0.46
    zmán
    -0.46
    ventud
    -0.44
    MessageTagHelper
    -0.44
     élector
    -0.44
    élica
    -0.44
    findpost
    -0.43
    POSITIVE LOGITS
     lên
    0.89
    ออก
    0.70
     xuống
    0.66
    ไว้
    0.58
    ลง
    0.53
     vào
    0.53
    ขึ้น
    0.52
     into
    0.52
     ra
    0.52
     Cowell
    0.47
    Act Density 0.001%

    No Known Activations