INDEX
    Explanations

    proper nouns

    New Auto-Interp
    Negative Logits
    s
    -0.67
     Noble
    -0.59
    ات
    -0.54
     revenue
    -0.52
    noble
    -0.50
    fell
    -0.49
    ranges
    -0.47
    Noble
    -0.47
    اتها
    -0.47
     frank
    -0.46
    POSITIVE LOGITS
    rolid
    0.67
     Wikimedijinoj
    0.67
     endwhile
    0.65
     getopt
    0.62
     nawr
    0.61
     oprot
    0.59
    Identyfik
    0.56
    protoimpl
    0.56
    elemField
    0.55
    TintMode
    0.55
    Act Density 0.286%

    No Known Activations