INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.40
     políticas
    0.40
     campañas
    0.39
    ProxyAgent
    0.38
     fundada
    0.38
     ناحيه
    0.38
     differentiable
    0.37
     प्रशास
    0.37
    เซีย
    0.37
    职务
    0.37
    POSITIVE LOGITS
     ingredients
    1.35
     Ingredients
    1.16
    ingredients
    1.12
    Ingredients
    1.09
     materials
    1.05
     ingredientes
    1.04
     ingrédients
    1.02
     ингреди
    1.01
    食材
    0.98
     ingred
    0.96
    Act Density 0.255%

    No Known Activations