INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.06
    .Word
    -0.06
     Mana
    -0.06
     occupancy
    -0.06
    "While
    -0.06
     determin
    -0.06
     evangelical
    -0.05
    振り
    -0.05
     ritual
    -0.05
     disdain
    -0.05
    POSITIVE LOGITS
     граж
    0.07
     иг
    0.07
     billeder
    0.07
     chết
    0.07
    (baseUrl
    0.07
    ibling
    0.07
    collapsed
    0.07
     DERP
    0.07
    로드
    0.06
     persecuted
    0.06
    Act Density 0.029%

    No Known Activations