INDEX
    Explanations

    references to the COVID-19 pandemic

    New Auto-Interp
    Negative Logits
    agues
    -0.15
     Types
    -0.15
    orer
    -0.14
    rzy
    -0.14
    iges
    -0.14
     آذ
    -0.14
    itas
    -0.14
    ладÑĥ
    -0.13
     Worst
    -0.13
    oser
    -0.13
    POSITIVE LOGITS
     Mast
    0.14
    ãģ¾ãģł
    0.14
    ARIO
    0.14
    cke
    0.13
    ous
    0.13
     mast
    0.13
    Slug
    0.13
    arios
    0.13
    ously
    0.13
    باز
    0.13
    Act Density 0.009%

    No Known Activations