INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ی
    0.51
    0.51
     looming
    0.50
    itize
    0.47
     sloping
    0.47
    َّ
    0.47
    ের
    0.47
    ^{\
    0.46
     functioning
    0.46
     rejoicing
    0.46
    POSITIVE LOGITS
     oneself
    0.49
    ahme
    0.47
    ness
    0.47
     большого
    0.47
    -
    0.45
    ton
    0.44
    ly
    0.44
    തിനുള്ള
    0.44
    ması
    0.43
    urnal
    0.41
    Act Density 0.045%

    No Known Activations