INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Elementor
    -0.10
     Episcopal
    -0.09
     trunks
    -0.09
     læng
    -0.08
    wem
    -0.08
     reto
    -0.08
    hey
    -0.08
     hidrat
    -0.08
     čet
    -0.08
     petición
    -0.08
    POSITIVE LOGITS
     economists
    0.16
     경제
    0.13
     economics
    0.13
     economist
    0.13
     economic
    0.12
     Economics
    0.12
    经济
    0.11
    Theory
    0.11
    理论
    0.11
     welfare
    0.11
    Act Density 0.036%

    No Known Activations