INDEX
    Explanations

    phrases that describe the reasoning or justification behind decisions or actions

    New Auto-Interp
    Negative Logits
    providedIn
    -0.60
     Administrativna
    -0.54
    jspb
    -0.50
    multer
    -0.48
    moveToFirst
    -0.47
    ]='\
    -0.46
    ImageContext
    -0.46
     brid
    -0.46
    AddTagHelper
    -0.45
    Fucking
    -0.45
    POSITIVE LOGITS
     rationale
    1.91
     Rationale
    1.84
    Rationale
    1.71
     justification
    0.98
     Begründung
    0.93
     Justification
    0.88
     razões
    0.80
     reasons
    0.78
     alasan
    0.77
     raison
    0.77
    Act Density 0.005%

    No Known Activations