INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     couverte
    -0.72
    WireFormatLite
    -0.70
    classnames
    -0.68
    nocześnie
    -0.67
     Résultats
    -0.67
     Gabel
    -0.66
    addContainerGap
    -0.65
     pylint
    -0.65
    erapeu
    -0.65
    OGND
    -0.64
    POSITIVE LOGITS
     forget
    0.70
     Jangan
    0.64
     Donny
    0.61
    0.61
    tay
    0.59
     jangan
    0.59
    Dont
    0.58
    TagHelper
    0.58
     Don
    0.57
    Don
    0.56
    Act Density 0.060%

    No Known Activations