INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Paint
    -1.69
     Paint
    -1.69
     paint
    -1.63
    paint
    -1.61
     PAINT
    -1.45
    PAINT
    -1.34
     Paints
    -1.26
     Painting
    -1.20
     paints
    -1.16
     painting
    -1.11
    POSITIVE LOGITS
    ers
    0.51
    torie
    0.47
    юза
    0.47
    t
    0.46
    subpackage
    0.44
     оригіналу
    0.44
    Contour
    0.44
    tand
    0.44
    ch
    0.43
    hlungen
    0.42
    Act Density 0.018%

    No Known Activations