INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Dante
    -0.07
    atha
    -0.07
     Giul
    -0.07
     reserva
    -0.07
    erta
    -0.07
     commentary
    -0.06
     scale
    -0.06
     fascism
    -0.06
     fasc
    -0.06
    -0.06
    POSITIVE LOGITS
     job
    0.10
    jobs
    0.10
     jobs
    0.09
     Job
    0.09
     JOB
    0.08
    Job
    0.08
    	job
    0.08
    .jobs
    0.07
     Jobs
    0.07
    /token
    0.07
    Act Density 0.022%

    No Known Activations