INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Face
    -0.69
     Roskov
    -0.63
    elemField
    -0.58
    Face
    -0.57
     GenerationType
    -0.57
     Tiberius
    -0.56
    GeneratedMessage
    -0.56
    afficheront
    -0.56
    Geplaatst
    -0.55
     photolibrary
    -0.55
    POSITIVE LOGITS
     the
    0.55
     And
    0.53
    bier
    0.51
     reality
    0.49
    0.49
    ToTable
    0.48
    BLES
    0.48
    AMILTON
    0.48
    bre
    0.47
     some
    0.47
    Act Density 0.239%

    No Known Activations