INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     less
    -1.77
     Less
    -1.53
    Less
    -1.49
     moins
    -1.33
     menos
    -1.31
     LESS
    -1.28
     weniger
    -1.16
     fewer
    -1.16
    less
    -1.13
     менее
    -1.05
    POSITIVE LOGITS
     <<<<<<<<<<<<<<
    0.59
     important
    0.58
    oneofs
    0.51
     meaningful
    0.49
     appreciated
    0.49
    AddTagHelper
    0.49
     substantial
    0.48
    0.47
    DataAnnotations
    0.47
     />\
    0.47
    Act Density 0.136%

    No Known Activations