INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bian
    -0.08
     cob
    -0.06
    برد
    -0.06
     vulgar
    -0.06
     Albert
    -0.06
    urs
    -0.06
     fuer
    -0.06
    -0.06
    fork
    -0.06
     rapp
    -0.06
    POSITIVE LOGITS
     time
    0.37
     Time
    0.31
     TIME
    0.26
    Time
    0.25
    time
    0.23
    -time
    0.22
    	time
    0.20
    -Time
    0.18
    _time
    0.18
     times
    0.18
    Act Density 0.172%

    No Known Activations