INDEX
    Explanations

    references to origin or source locations in sentences

    New Auto-Interp
    Negative Logits
     itſelf
    -0.65
    ſelf
    -0.58
     himſelf
    -0.56
     myſelf
    -0.56
     ſever
    -0.54
     zoude
    -0.54
     Anſ
    -0.52
     Jefus
    -0.51
     Eſ
    -0.50
     ſta
    -0.50
    POSITIVE LOGITS
    来自
    0.63
    來自
    0.61
     from
    0.57
    __(/*!
    0.50
    getFrom
    0.49
     originating
    0.48
     earlier
    0.48
     irrelevant
    0.47
    จาก
    0.47
     FROM
    0.47
    Act Density 0.423%

    No Known Activations