INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .Getter
    -0.07
    -0.07
     Rudd
    -0.07
    .XR
    -0.07
     Ray
    -0.06
     Display
    -0.06
     Rud
    -0.06
     around
    -0.06
    -0.06
     push
    -0.06
    POSITIVE LOGITS
     College
    0.16
    College
    0.14
     college
    0.14
    college
    0.10
     Colleges
    0.09
     colleges
    0.09
     Colin
    0.09
     colleg
    0.09
     collegiate
    0.09
    cce
    0.08
    Act Density 0.010%

    No Known Activations