INDEX
    Explanations

    binary truth values and their corresponding labels

    New Auto-Interp
    Negative Logits
    batore
    -0.65
    évaluateur
    -0.63
    rrggbb
    -0.61
     shown
    -0.58
    bricht
    -0.57
    Składniki
    -0.57
     plupart
    -0.54
     itſelf
    -0.54
     tph
    -0.54
     SSSR
    -0.54
    POSITIVE LOGITS
     ***!
    0.78
    MLLoader
    0.65
    hoeddwyd
    0.59
    issus
    0.57
    FXML
    0.57
    Vers
    0.53
    verwijspagina
    0.51
     nahilalakip
    0.51
    LookAnd
    0.51
     Normdatei
    0.49
    Act Density 0.010%

    No Known Activations