INDEX
    Explanations

    references to actions and assertions made by individuals

    New Auto-Interp
    Negative Logits
     оригіналу
    -0.70
    doesn
    -0.57
    Do
    -0.56
    dos
    -0.56
    don
    -0.54
    ıştı
    -0.53
    gráficos
    -0.53
    DON
    -0.53
    jspx
    -0.52
    DO
    -0.50
    POSITIVE LOGITS
     did
    2.64
    did
    1.92
    Did
    1.85
     Did
    1.84
     DID
    1.28
    DID
    1.09
     didst
    0.91
     gjorde
    0.86
     done
    0.85
     hicieron
    0.72
    Act Density 0.373%

    No Known Activations