INDEX
    Explanations

    content related to healthcare and social issues

    New Auto-Interp
    Negative Logits
     
    -0.15
    ãĢĭï¼Ī
    -0.13
    oot
    -0.13
    ange
    -0.13
    uck
    -0.13
     Continued
    -0.13
    IBE
    -0.13
    aus
    -0.13
     ãģĮ
    -0.12
    opy
    -0.12
    POSITIVE LOGITS
     There
    0.19
     Although
    0.19
     This
    0.19
     The
    0.18
     If
    0.17
    reetings
    0.17
     When
    0.17
     Since
    0.16
     ************************************************************************
    0.16
    There
    0.16
    Act Density 0.716%

    No Known Activations