INDEX
    Explanations

    references to specific infectious diseases and their impacts

    New Auto-Interp
    Negative Logits
    ="__
    -0.16
    _mb
    -0.14
     Rug
    -0.14
     Ph
    -0.14
    ali
    -0.14
     Four
    -0.14
    weit
    -0.13
    ego
    -0.13
    ÑĬ
    -0.13
    away
    -0.13
    POSITIVE LOGITS
    HeaderValue
    0.16
    edral
    0.15
    edl
    0.15
    lias
    0.15
    jang
    0.15
     zev
    0.15
    ừa
    0.15
    лаÑĩ
    0.15
    .Clock
    0.15
     Cran
    0.15
    Act Density 0.010%

    No Known Activations