INDEX
    Explanations

    programming code

    New Auto-Interp
    Negative Logits
    e
    -1.20
    a
    -1.19
    er
    -1.10
    o
    -1.01
    i
    -0.85
    ه
    -0.81
    en
    -0.74
    y
    -0.72
    eer
    -0.71
    ی
    -0.71
    POSITIVE LOGITS
     itſelf
    1.00
     Efq
    0.98
     myſelf
    0.95
    ^(@)
    0.92
     Monfieur
    0.91
     ་་
    0.90
    ſelves
    0.89
     Jefus
    0.88
     ―――――
    0.88
    ſelf
    0.85
    Act Density 0.423%

    No Known Activations