INDEX
    Explanations

    Mixed content/queries

    New Auto-Interp
    Negative Logits
     Vas
    -0.07
     уверен
    -0.07
    _PLAYER
    -0.06
    民心
    -0.06
     disparities
    -0.06
     List
    -0.06
    Own
    -0.06
    עיתונ
    -0.06
    /fontawesome
    -0.06
    -0.06
    POSITIVE LOGITS
    chl
    0.08
     sider
    0.07
     confronted
    0.07
    𝐶
    0.07
     controllers
    0.07
    𝐿
    0.07
     Carlos
    0.07
    installed
    0.07
     church
    0.07
     quality
    0.07
    Act Density 0.216%

    No Known Activations