INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    +w
    -0.07
     οργ
    -0.07
    	delta
    -0.07
    	user
    -0.06
     Kir
    -0.06
    eder
    -0.06
     Método
    -0.06
    	real
    -0.06
     Yer
    -0.06
     rotated
    -0.06
    POSITIVE LOGITS
     sc
    0.11
     Sc
    0.11
     SC
    0.09
     scout
    0.09
     scam
    0.08
    (sc
    0.07
     Scalia
    0.07
    (student
    0.07
    Sc
    0.07
     scram
    0.07
    Act Density 0.071%

    No Known Activations