INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bureaucr
    -0.08
    -0.08
    implementation
    -0.08
     spe
    -0.07
    (vector
    -0.07
    (Vector
    -0.07
     broadcasters
    -0.07
     bureaucracy
    -0.07
     Genau
    -0.07
    arith
    -0.07
    POSITIVE LOGITS
     meals
    0.14
     भोजन
    0.13
     dinner
    0.13
     refeições
    0.13
     খাব
    0.12
     ഭക്ഷ
    0.12
     comidas
    0.12
     Meals
    0.11
     dinners
    0.11
     snacks
    0.11
    Act Density 0.013%

    No Known Activations