INDEX
    Explanations

    instances of the word "runner" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    Portail
    -0.61
    ymce
    -0.52
    <bos>
    -0.49
    addGap
    -0.48
    ig
    -0.46
    SuppressLint
    -0.45
    indexOf
    -0.43
    IG
    -0.42
     Garg
    -0.41
     Diaz
    -0.40
    POSITIVE LOGITS
     Runner
    1.45
     runner
    1.35
    Runner
    1.30
    runner
    1.22
     Runners
    1.13
     runners
    1.05
    Runners
    1.02
    runners
    0.98
    SpringRunner
    0.85
    TestRunner
    0.85
    Act Density 0.003%

    No Known Activations