INDEX
    Explanations

    references to limitations or restrictions, particularly in the context of entries or applications

    New Auto-Interp
    Negative Logits
    aho
    -0.16
    ITA
    -0.16
    (Output
    -0.15
    ettle
    -0.15
    ita
    -0.14
    uga
    -0.14
    اØŃÛĮ
    -0.14
    ille
    -0.14
    ague
    -0.14
    istar
    -0.13
    POSITIVE LOGITS
     entry
    1.04
     Entry
    0.90
    entry
    0.87
    -entry
    0.84
    Entry
    0.82
    _entry
    0.80
     entries
    0.79
     ENTRY
    0.78
     enter
    0.75
    .entry
    0.75
    Act Density 0.118%

    No Known Activations