INDEX
    Explanations

    references to government and military structures

    New Auto-Interp
    Negative Logits
    hop
    -0.15
    ilar
    -0.15
    upa
    -0.15
     åı
    -0.15
     then
    -0.14
    loff
    -0.14
    æľĢ
    -0.14
    ê·¼
    -0.14
     themselves
    -0.14
    oses
    -0.13
    POSITIVE LOGITS
    ä¹ĭä¸Ģ
    0.22
    ä¹Łæĺ¯
    0.17
    zeit
    0.16
    plaintext
    0.15
    akat
    0.15
     Schmidt
    0.14
    riv
    0.14
    vfs
    0.14
    fan
    0.14
    amac
    0.14
    Act Density 0.221%

    No Known Activations