INDEX
    Explanations

    references to Iranian political figures and institutions

    New Auto-Interp
    Negative Logits
    hq
    -0.17
    .scalablytyped
    -0.15
    ây
    -0.15
     Ø·ÙĦ
    -0.15
    inke
    -0.14
    llib
    -0.14
    Tuy
    -0.14
    hir
    -0.14
     Fork
    -0.14
    wend
    -0.14
    POSITIVE LOGITS
     Binder
    0.15
    į
    0.15
     Ber
    0.14
    vac
    0.14
     grade
    0.14
    ارÙĩ
    0.14
    MainFrame
    0.14
     Grade
    0.14
    inas
    0.13
    otle
    0.13
    Act Density 0.015%

    No Known Activations