INDEX
    Explanations

    information concerning health and medical topics

    New Auto-Interp
    Negative Logits
     then
    -0.28
     Then
    -0.26
    then
    -0.25
     THEN
    -0.24
    Then
    -0.21
    THEN
    -0.21
     então
    -0.19
     poi
    -0.18
     puis
    -0.18
     dann
    -0.17
    POSITIVE LOGITS
    ÑĢаÐ
    0.20
    аÐ
    0.19
    оÐ
    0.19
    urrenc
    0.17
     wom
    0.17
    leyin
    0.17
     Europ
    0.17
    eyJ
    0.17
    IIIK
    0.16
    gend
    0.16
    Act Density 0.339%

    No Known Activations