INDEX
    Explanations

    references to academic authors and their affiliations or contributions in research papers

    New Auto-Interp
    Negative Logits
     itſelf
    -0.57
     Cæsar
    -0.56
    ſelf
    -0.54
     fubject
    -0.53
    Etimología
    -0.52
     Reſ
    -0.52
    性和
    -0.52
     myſelf
    -0.51
     ſche
    -0.50
     pleaſure
    -0.50
    POSITIVE LOGITS
    autoreleasepool
    0.42
     nyelven
    0.40
    ’.
    0.39
    .
    0.39
    '.
    0.38
    Datuak
    0.38
     infine
    0.37
     још
    0.36
    itattu
    0.36
     llegaron
    0.36
    Act Density 0.239%

    No Known Activations