INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _modifier
    -0.08
     حسن
    -0.07
    -0.07
     الهند
    -0.07
    Compression
    -0.07
    -0.07
    iant
    -0.07
     Documentary
    -0.07
     narrow
    -0.07
     Bulgaria
    -0.06
    POSITIVE LOGITS
     wake
    0.16
     Wake
    0.13
     wakes
    0.12
    Wake
    0.11
     waking
    0.11
     woke
    0.11
    wake
    0.10
     взрос
    0.09
    UCE
    0.08
     Awake
    0.08
    Act Density 0.004%

    No Known Activations