INDEX
    Explanations

    names of places and entities related to health or governance

    New Auto-Interp
    Negative Logits
    ,
    -0.91
     of
    -0.78
     in
    -0.78
     on
    -0.73
     the
    -0.70
    .
    -0.70
     is
    -0.69
     for
    -0.69
     with
    -0.68
     that
    -0.68
    POSITIVE LOGITS
    PhysRev
    0.84
     Paglinawan
    0.84
     كومونز
    0.81
     queſta
    0.80
    ſelves
    0.79
     RSITY
    0.79
    FunctionFlags
    0.78
     Meksiku
    0.78
     Савезне
    0.77
     choreographer
    0.76
    Act Density 0.786%

    No Known Activations