INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    تقاوى
    -0.93
    abestanden
    -0.84
     iſt
    -0.81
     BoxDecoration
    -0.66
    HtmlAttribute
    -0.66
     ―――――
    -0.65
    amię
    -0.65
     itſelf
    -0.64
     EnglishChoose
    -0.63
     Asegúrese
    -0.63
    POSITIVE LOGITS
    __["
    0.62
     Er
    0.58
    hova
    0.55
     Sch
    0.55
     George
    0.54
    ')}}">
    0.53
     Har
    0.53
     Mau
    0.53
    دارد
    0.53
    Er
    0.53
    Act Density 0.363%

    No Known Activations