INDEX
    Explanations

    punctuation marks and timing references

    New Auto-Interp
    Negative Logits
    برÛĮ
    -0.08
    Loop
    -0.06
    è¨Ģãģ£ãģŁ
    -0.06
    ugins
    -0.06
    /pop
    -0.06
     subsequ
    -0.06
    thouse
    -0.06
    marshall
    -0.06
    ç½²
    -0.06
    کرÛĮ
    -0.06
    POSITIVE LOGITS
     Soros
    0.06
     western
    0.06
    ãĤ«ãĥĨãĤ´ãĥª
    0.06
    -West
    0.05
     legacy
    0.05
    Portland
    0.05
     Pul
    0.05
     native
    0.05
    ga
    0.05
    clair
    0.05
    Act Density 0.001%

    No Known Activations