INDEX
    Explanations

    acronyms and abbreviations relevant to various organizations and initiatives

    New Auto-Interp
    Negative Logits
    ÑĤÑĢа
    -0.16
    оÑĩка
    -0.15
    OKIE
    -0.15
     Levin
    -0.14
    issen
    -0.14
    λεκ
    -0.14
     Sesso
    -0.14
    оÑĩкÑĥ
    -0.13
    .threshold
    -0.13
    urret
    -0.13
    POSITIVE LOGITS
    ubre
    0.16
    ellig
    0.16
     wake
    0.14
     Wake
    0.14
    zk
    0.14
     Direct
    0.13
    wa
    0.13
    atak
    0.13
    ing
    0.13
     cri
    0.13
    Act Density 0.051%

    No Known Activations