INDEX
    Explanations

    non-English words

    New Auto-Interp
    Negative Logits
    -0.06
    ила
    -0.06
    imar
    -0.06
    loquent
    -0.06
     noc
    -0.06
    ΙΛ
    -0.06
     bilim
    -0.06
     yr
    -0.06
    [h
    -0.06
     rhyme
    -0.06
    POSITIVE LOGITS
    0.07
    0.07
     MCP
    0.07
    0.06
    _RESERVED
    0.06
    _Un
    0.06
    profit
    0.06
     краї
    0.06
    \\
    0.06
     seks
    0.06
    Act Density 0.020%

    No Known Activations