INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    zoom
    -0.07
    езда
    -0.07
     Ideal
    -0.06
     Institutional
    -0.06
    cia
    -0.06
    klä
    -0.06
    _PED
    -0.06
    apol
    -0.06
    .ne
    -0.06
     Рад
    -0.06
    POSITIVE LOGITS
     virus
    0.16
     Virus
    0.12
     viruses
    0.11
    avirus
    0.09
    _tensor
    0.08
    irus
    0.08
     coronavirus
    0.08
    -vous
    0.07
     Chavez
    0.07
     crisis
    0.07
    Act Density 0.007%

    No Known Activations