INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     unlawfully
    0.37
     pathogenesis
    0.37
     tumorigen
    0.35
     dystopian
    0.34
     neoliberal
    0.34
     dissidents
    0.33
     autoridades
    0.33
    कांक्षा
    0.33
     autorización
    0.32
     Đảng
    0.32
    POSITIVE LOGITS
     t
    0.41
     cauliflower
    0.40
     w
    0.40
     round
    0.38
     ک
    0.38
     bell
    0.38
     k
    0.37
     B
    0.37
     curly
    0.37
     variety
    0.36
    Act Density 0.098%

    No Known Activations