INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    EAR
    -0.06
     Desktop
    -0.06
    -0.06
     gio
    -0.06
    uti
    -0.06
     PIE
    -0.06
    기는
    -0.06
    Detect
    -0.06
    -0.06
    Make
    -0.06
    POSITIVE LOGITS
     Baghdad
    0.21
     Iraqi
    0.14
     Angola
    0.09
     Baghd
    0.08
     Mosul
    0.08
     Imam
    0.08
     Saddam
    0.08
     결정
    0.07
     FML
    0.07
     Ibn
    0.07
    Act Density 0.005%

    No Known Activations