INDEX
    Explanations

    references to various groups, medications, and agents in a health or medical context

    New Auto-Interp
    Negative Logits
     OMITBAD
    -0.82
     مشين
    -0.69
    ✭✭
    -0.65
    enumii
    -0.61
    Hochspringen
    -0.60
     faſt
    -0.60
     pleaſure
    -0.59
     виправивши
    -0.58
     Efq
    -0.57
    GEBURTSDATUM
    -0.57
    POSITIVE LOGITS
     I
    0.36
     embraced
    0.36
    saga
    0.34
     that
    0.34
    ENOMEM
    0.33
    丁目
    0.33
     Malam
    0.33
     sold
    0.33
    ijnt
    0.32
     druck
    0.32
    Act Density 0.172%

    No Known Activations