INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     palk
    -0.65
    thodon
    -0.62
     forth
    -0.59
     redor
    -0.58
     Poche
    -0.58
    etan
    -0.57
     mourut
    -0.55
    Extra
    -0.55
    Ber
    -0.55
     Dob
    -0.55
    POSITIVE LOGITS
     viruses
    1.59
     virus
    1.58
     Virus
    1.54
     Viruses
    1.49
    Virus
    1.49
    virus
    1.41
    viruses
    1.22
    onavirus
    1.18
     VIR
    1.11
     coronavirus
    1.09
    Act Density 0.212%

    No Known Activations