INDEX
    Explanations

    references to specific locations and significant time periods

    New Auto-Interp
    Negative Logits
    ÏĦει
    -0.15
     Tun
    -0.15
    zw
    -0.15
    .sdk
    -0.15
    addir
    -0.14
    .Utility
    -0.14
    anguages
    -0.14
    olo
    -0.14
    icult
    -0.13
    yard
    -0.13
    POSITIVE LOGITS
    USA
    0.15
    /archive
    0.15
    ùi
    0.14
    -Israel
    0.14
    uta
    0.14
    ãĤ·ãĥ£ãĥ«
    0.14
    /INFO
    0.14
    ÅĻad
    0.13
     ÐļÑĢи
    0.13
     Drinking
    0.13
    Act Density 0.369%

    No Known Activations