INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     imput
    -0.09
    .reducer
    -0.08
    入力
    -0.08
     چو
    -0.08
    .capture
    -0.07
     снима
    -0.07
     entrer
    -0.07
    .fold
    -0.07
     דו
    -0.07
     وارد
    -0.07
    POSITIVE LOGITS
     served
    0.44
     serve
    0.41
     servir
    0.41
     Served
    0.40
     serving
    0.40
     Serve
    0.39
    Serving
    0.39
     serves
    0.39
     Serving
    0.39
     servido
    0.38
    Act Density 0.034%

    No Known Activations