INDEX
    Explanations

    formal components and sections of academic research papers

    New Auto-Interp
    Negative Logits
     surla
    -0.94
    ftagPool
    -0.80
     linkovi
    -0.78
     Wikimedijinoj
    -0.75
    CodeAttribute
    -0.71
    istoitu
    -0.70
    Boas
    -0.67
    homonymie
    -0.67
     nahilalakip
    -0.65
     estimés
    -0.63
    POSITIVE LOGITS
     BoxFit
    0.53
     szól
    0.52
    gonic
    0.49
    Démographie
    0.47
     tubo
    0.44
     követ
    0.43
     gedaan
    0.43
     retir
    0.43
    hangi
    0.42
     имен
    0.42
    Act Density 0.018%

    No Known Activations