INDEX
    Explanations

    pronouns, particularly focused on references to subjects

    New Auto-Interp
    Negative Logits
    ization
    -0.46
    ized
    -0.45
    ื่อน
    -0.43
    ist
    -0.43
    OIR
    -0.42
    hip
    -0.41
    kül
    -0.41
     Faz
    -0.40
    koop
    -0.40
    -0.40
    POSITIVE LOGITS
    InjectAttribute
    0.98
    BufferException
    0.96
     ostavi
    0.93
    Šaltiniai
    0.92
    Demografie
    0.91
     NUKAT
    0.90
    TagMode
    0.88
     الرياضيه
    0.88
    Datuak
    0.88
     تانيه
    0.87
    Act Density 0.265%

    No Known Activations