INDEX
    Explanations

    references to specific sports teams and their affiliations

    New Auto-Interp
    Negative Logits
    featureID
    -0.51
    ArgsConstructor
    -0.48
    eps
    -0.47
    Jîn
    -0.45
    ัส
    -0.45
    henswürdigkeiten
    -0.44
     conci
    -0.44
    pilar
    -0.44
    צה
    -0.43
    -0.43
    POSITIVE LOGITS
     ProtoMessage
    0.70
     AssemblyCulture
    0.68
     transfieras
    0.67
    WaitGroup
    0.65
     Microb
    0.59
     jspb
    0.57
     UITableViewCell
    0.56
    Académie
    0.56
    árol
    0.56
    tvguidetime
    0.55
    Act Density 0.058%

    No Known Activations