INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    =num
    -0.07
     Nel
    -0.07
    -0.07
     McDon
    -0.07
     wię
    -0.07
     Mand
    -0.07
     adoles
    -0.07
    _POS
    -0.07
     вариан
    -0.07
     verschill
    -0.07
    POSITIVE LOGITS
     Earth
    0.14
     earth
    0.14
    Earth
    0.10
    -earth
    0.08
    earth
    0.07
    arks
    0.07
     AIR
    0.07
     эт
    0.07
    ark
    0.07
     Terra
    0.06
    Act Density 0.012%

    No Known Activations