INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    eket
    -0.07
    اعات
    -0.06
     süt
    -0.06
    -java
    -0.06
    zure
    -0.06
    -0.06
    青年
    -0.06
    -0.06
    -0.06
    -0.06
    POSITIVE LOGITS
    READ
    0.08
     happier
    0.07
    .lower
    0.07
    _subs
    0.07
     auditory
    0.06
    	
    ↵
    ↵
    0.06
     filho
    0.06
    linger
    0.06
    .PER
    0.06
    -spe
    0.06
    Act Density 0.029%

    No Known Activations