INDEX
    Explanations

    names of authors and their associated contributions

    New Auto-Interp
    Negative Logits
     ÙħسÙĦÙħاÙĨ
    -0.16
    ÑĢоÑĩ
    -0.16
    ÃĹ↵↵
    -0.16
    amat
    -0.15
    brero
    -0.15
     Cotton
    -0.15
     ÙħØŃÙħÙĪØ¯
    -0.14
    ller
    -0.14
    ساÙĦ
    -0.14
    ulet
    -0.14
    POSITIVE LOGITS
     Saudi
    0.38
     Riyadh
    0.36
    Saudi
    0.31
     Saudis
    0.30
     Saud
    0.28
     Kingdom
    0.28
     Prince
    0.26
     Arabia
    0.25
     kingdom
    0.24
     King
    0.23
    Act Density 0.042%

    No Known Activations