INDEX
    Explanations

    references to numerical data or citations in texts

    New Auto-Interp
    Negative Logits
    ivan
    -0.16
    ilogue
    -0.15
    fried
    -0.15
    iese
    -0.15
    ift
    -0.14
    onda
    -0.14
    etta
    -0.14
    ialog
    -0.14
    entina
    -0.14
     sticky
    -0.14
    POSITIVE LOGITS
     Wyn
    0.17
    assistant
    0.16
    ryn
    0.15
    //{{
    0.15
    pais
    0.15
     Trojan
    0.15
    appe
    0.14
     PREF
    0.14
    onio
    0.14
    ÑĢÑĥн
    0.14
    Act Density 0.006%

    No Known Activations