INDEX
    Explanations

    occurrences of the word "run" and its variations

    New Auto-Interp
    Negative Logits
    <bos>
    -0.61
    DotNetBar
    -0.48
    tste
    -0.47
    -0.46
    Cec
    -0.46
    cetti
    -0.44
    Massimo
    -0.44
    ••••
    -0.43
    efte
    -0.43
     Cec
    -0.43
    POSITIVE LOGITS
     run
    1.93
    run
    1.80
     Run
    1.76
    Run
    1.67
     RUN
    1.58
    RUN
    1.52
     runs
    1.46
     Runs
    1.46
    runs
    1.32
    Runs
    1.32
    Act Density 0.029%

    No Known Activations