INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     وتسجيلات
    -0.94
    twimg
    -0.82
    تفصیلات
    -0.74
     '\\;'
    -0.72
    المناصب
    -0.71
    })();
    
    -0.71
    "]
    
    -0.69
     насељу
    -0.69
    "],
    
    -0.69
    Jeografia
    -0.68
    POSITIVE LOGITS
     nature
    1.00
     Mother
    0.75
     mother
    0.66
     prí
    0.64
     natural
    0.63
     pří
    0.63
    Mother
    0.63
     nat
    0.60
    nature
    0.57
    自然
    0.56
    Act Density 0.002%

    No Known Activations