INDEX
    Explanations

    references to authorship and contributions within academic manuscripts

    New Auto-Interp
    Negative Logits
     AssemblyProduct
    -0.74
    Autoritní
    -0.70
    writeFieldEnd
    -0.68
     ProtoMessage
    -0.68
     ujednoznacz
    -0.66
    kloped
    -0.64
     ویکی‌پدی
    -0.63
    niająca
    -0.63
    IntoConstraints
    -0.63
     мәкал
    -0.62
    POSITIVE LOGITS
    mazioni
    0.55
     jotka
    0.52
     realized
    0.47
     realization
    0.47
     scented
    0.46
    errHandler
    0.45
     realizes
    0.44
     enumerated
    0.44
     erreich
    0.43
    evos
    0.43
    Act Density 0.010%

    No Known Activations