INDEX
    Explanations

    Start of text/conversations

    New Auto-Interp
    Negative Logits
     تست
    -0.08
    িদ
    -0.07
    ước
    -0.07
     swirling
    -0.07
    िता
    -0.07
     misconceptions
    -0.07
     understandable
    -0.07
    -0.07
     Vors
    -0.07
     zun
    -0.07
    POSITIVE LOGITS
    nd
    0.12
    (二
    0.12
    -thirds
    0.11
    第二
    0.11
     രണ്ടാം
    0.10
     দ্বিত
    0.10
     second
    0.10
    second
    0.10
     zweiten
    0.10
    .secondary
    0.09
    Act Density 0.141%

    No Known Activations