INDEX
    Explanations

    programming variable and type names

    New Auto-Interp
    Negative Logits
     malgré
    -0.97
     industrielle
    -0.95
     має
    -0.94
     السياس
    -0.93
     ſhould
    -0.87
    -0.87
     quinze
    -0.85
    趿
    -0.84
    尽快
    -0.84
     يبدو
    -0.84
    POSITIVE LOGITS
     that
    1.40
     new
    1.16
     first
    1.10
     current
    1.09
     like
    1.07
     where
    1.03
     their
    1.00
     this
    0.99
     "..\..\
    0.94
     what
    0.93
    Act Density 0.017%

    No Known Activations