INDEX
    Explanations

    phrases that express quality and value in various contexts

    New Auto-Interp
    Negative Logits
     onSave
    -0.55
    apunov
    -0.54
     ospiti
    -0.53
     suon
    -0.51
    IMPORTED
    -0.50
    urlpatterns
    -0.49
    PasswordEncoder
    -0.49
    contentLoaded
    -0.49
    zzlies
    -0.48
     okuyayım
    -0.47
    POSITIVE LOGITS
     done
    1.19
    done
    1.06
     DONE
    1.01
    Doing
    0.95
    Done
    0.95
     Done
    0.94
    doing
    0.93
     doing
    0.93
     DOING
    0.89
     Doing
    0.86
    Act Density 0.182%

    No Known Activations