INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    تقاوى
    -0.55
    Portale
    -0.44
     كومونز
    -0.43
     groom
    -0.43
    Erreferentziak
    -0.42
     junit
    -0.40
     beru
    -0.39
    ってみて
    -0.39
     Vino
    -0.39
     apt
    -0.38
    POSITIVE LOGITS
    8
    0.73
    AddHtmlAttribute
    0.69
    7
    0.67
    9
    0.62
    6
    0.60
    étrie
    0.60
    хьтан
    0.59
     Eighth
    0.57
     poveznice
    0.56
    zzlies
    0.54
    Act Density 0.002%

    No Known Activations