INDEX
    Explanations

    references to Iranian political figures and leadership

    New Auto-Interp
    Negative Logits
    pson
    -0.15
    enheim
    -0.15
    ogy
    -0.14
    ná
    -0.14
    OGLE
    -0.14
    elihood
    -0.14
    	Copyright
    -0.14
    ui
    -0.14
     guy
    -0.13
    à¹ĥà¸Ī
    -0.13
    POSITIVE LOGITS
    hil
    0.15
    ARIO
    0.15
    ilha
    0.15
    ÑĸÑĢ
    0.14
     recreation
    0.14
     ì¼ĢìĿ´
    0.14
    215
    0.14
    allet
    0.13
    ario
    0.13
    pts
    0.13
    Act Density 0.003%

    No Known Activations