INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Feng
    -0.07
    .lists
    -0.06
     Row
    -0.06
    .writ
    -0.06
     diffé
    -0.06
     dou
    -0.06
    -0.06
     phường
    -0.06
     preorder
    -0.06
    erokee
    -0.06
    POSITIVE LOGITS
     capital
    0.15
     Capital
    0.14
    Capital
    0.12
    capital
    0.10
     капит
    0.10
     капіт
    0.10
     CAPITAL
    0.09
     capitalists
    0.09
     capitalist
    0.08
     capitalism
    0.08
    Act Density 0.005%

    No Known Activations