INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ?>">
    -0.96
     Paral
    -0.83
    ly
    -0.83
    nders
    -0.80
    Bav
    -0.78
    Embedded
    -0.76
     Ruman
    -0.75
     Refle
    -0.74
     }>
    -0.73
     Ait
    -0.73
    POSITIVE LOGITS
     góry
    0.75
    DateField
    0.72
     Fife
    0.71
     Woodford
    0.71
     Biscuit
    0.70
    wezen
    0.70
     OPERATOR
    0.69
     Dodson
    0.69
    OPERATOR
    0.69
     Stans
    0.68
    Act Density 0.480%

    No Known Activations