INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    sentence
    -0.12
    rule
    -0.11
    models
    -0.11
    schema
    -0.11
    themes
    -0.11
    modelo
    -0.11
    calculate
    -0.11
    model
    -0.11
    workflow
    -0.11
    rules
    -0.11
    POSITIVE LOGITS
    -many
    0.23
     many
    0.22
    0.18
     viele
    0.17
    -th
    0.16
     number
    0.16
    >
    0.16
    0.16
    >=
    0.16
     العديد
    0.16
    Act Density 0.094%

    No Known Activations