INDEX
    Explanations

    references to medical or health-related guidelines

    New Auto-Interp
    Negative Logits
     houſe
    -1.01
     виправивши
    -1.00
     Houſe
    -0.98
     Wikimedijinoj
    -0.97
     يتيمه
    -0.96
     חיצוניים
    -0.94
    #+#
    -0.92
     pleaſure
    -0.92
     itſelf
    -0.92
     Мексичка
    -0.91
    POSITIVE LOGITS
    0.54
    0.34
    0.34
     …
    0.34
    0.33
     es
    0.33
     .
    0.33
     very
    0.32
    hal
    0.32
     k
    0.32
    Act Density 2.884%

    No Known Activations