INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Interpretation
    -0.09
     ساده
    -0.09
     jednoduch
    -0.09
     interpretation
    -0.08
    -0.08
     Approach
    -0.08
     Simpl
    -0.08
     sencilla
    -0.08
     Solution
    -0.08
     eenvoudige
    -0.08
    POSITIVE LOGITS
     options
    0.15
     choices
    0.14
     opciones
    0.14
     multiple
    0.14
    options
    0.13
     opções
    0.13
    Options
    0.13
     exam
    0.13
     الخيارات
    0.13
     quiz
    0.13
    Act Density 0.047%

    No Known Activations