INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     keuze
    0.50
    choices
    0.44
    选择了
    0.44
     विकल्पों
    0.44
     choices
    0.44
    0.44
     выбрать
    0.43
    న్నీ
    0.43
     choice
    0.43
    choice
    0.42
    POSITIVE LOGITS
    収入
    0.42
    資源
    0.41
     Compens
    0.40
     malnutrition
    0.40
     incrementar
    0.40
     guer
    0.39
     soldier
    0.38
    それでも
    0.38
     compensate
    0.37
    頑張
    0.37
    Act Density 0.002%

    No Known Activations