INDEX
    Explanations

    terms related to health risks and medical conditions

    New Auto-Interp
    Negative Logits
     myſelf
    -0.73
    seamnă
    -0.73
     ―――――
    -0.73
     Jefus
    -0.72
     Majefty
    -0.71
     Monfieur
    -0.71
     Diſ
    -0.70
     iſt
    -0.69
     raiſ
    -0.68
     himſelf
    -0.67
    POSITIVE LOGITS
     misalnya
    1.81
     например
    1.75
     bijvoorbeeld
    1.62
     például
    1.60
     beispielsweise
    1.53
     Например
    1.35
     například
    1.28
    Например
    1.24
     مث
    1.22
     example
    1.21
    Act Density 0.418%

    No Known Activations