INDEX
    Explanations

    references to specific individuals or names

    New Auto-Interp
    Negative Logits
    '])){
    
    -0.71
     OFDb
    -0.61
    CppCodeGen
    -0.59
     esternos
    -0.59
    TagMode
    -0.58
    ']))
    
    -0.57
     Drittan
    -0.57
    /*
    -0.56
    NUMX
    -0.56
    ftant
    -0.54
    POSITIVE LOGITS
    ModelBuilder
    0.75
     Cowper
    0.64
    cemic
    0.60
    GeneratedMessage
    0.59
     незавершена
    0.58
     photosynthesis
    0.56
    scrollbar
    0.55
     Lâm
    0.54
     aDecoder
    0.53
     atenta
    0.52
    Act Density 0.000%

    No Known Activations