INDEX
    Explanations

    phrases indicating varying degrees of responsibility and professionalism

    New Auto-Interp
    Negative Logits
    OMPI
    -0.16
    uhn
    -0.15
    /preferences
    -0.14
    gota
    -0.14
    605
    -0.14
    anske
    -0.14
    ispecies
    -0.14
    BootApplication
    -0.13
    jd
    -0.13
    ÙĤرار
    -0.13
    POSITIVE LOGITS
     manner
    1.11
     fashion
    0.94
     way
    0.91
     Fashion
    0.66
     ways
    0.65
    -fashion
    0.62
     manière
    0.62
    æĸ¹å¼ı
    0.62
     WAY
    0.60
     manera
    0.59
    Act Density 0.159%

    No Known Activations