INDEX
    Explanations

    references to notable individuals and their achievements

    New Auto-Interp
    Negative Logits
    lek
    -0.18
    antage
    -0.16
     themselves
    -0.15
    atorium
    -0.15
    éĻ
    -0.15
     коÑĤоÑĢое
    -0.15
     Auxiliary
    -0.15
    اÙģÙĬØ©
    -0.15
    autiful
    -0.15
    ayout
    -0.15
    POSITIVE LOGITS
     his
    0.19
    "He
    0.18
     Onun
    0.17
    ä»ĸçļĦ
    0.15
     nobody
    0.15
     who
    0.15
     его
    0.15
     suoi
    0.15
     whose
    0.15
     age
    0.15
    Act Density 0.390%

    No Known Activations