INDEX
    Explanations

    the word "pass" in various forms and contexts

    New Auto-Interp
    Negative Logits
     Drummond
    -0.94
     Wul
    -0.75
     Wolff
    -0.73
    ılıyor
    -0.72
     Wimbledon
    -0.71
     Tinto
    -0.69
     Bolton
    -0.69
     resourceCulture
    -0.68
    cenario
    -0.68
     Mukherjee
    -0.68
    POSITIVE LOGITS
     Pass
    2.64
     pass
    2.57
    Pass
    2.48
    pass
    2.46
     PASS
    2.40
     passes
    2.35
     Passes
    2.28
    PASS
    2.26
     passing
    2.22
    passes
    2.14
    Act Density 0.089%

    No Known Activations