INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     хлеб
    -0.08
     multicultural
    -0.08
     языка
    -0.08
     culturele
    -0.08
    สดง
    -0.08
    ùng
    -0.08
     chicken
    -0.07
    ความ
    -0.07
     Mojo
    -0.07
    -0.07
    POSITIVE LOGITS
     slated
    0.08
    答案
    0.08
     Mehrheit
    0.08
    bob
    0.08
    填写
    0.07
     Jared
    0.07
    0.07
     furn
    0.07
     recieved
    0.07
    Completion
    0.07
    Act Density 0.148%

    No Known Activations