INDEX
    Explanations

    average, total, percent

    New Auto-Interp
    Negative Logits
    学习
    0.50
    فهام
    0.50
    0.50
    帮忙
    0.48
    0.47
     pembelajaran
    0.47
     학습
    0.47
     apprendre
    0.47
     pédagog
    0.46
    客观
    0.46
    POSITIVE LOGITS
    average
    0.69
     averages
    0.66
    total
    0.66
     total
    0.64
     average
    0.64
     overall
    0.64
    %
    0.63
    overall
    0.61
    percent
    0.58
     percentages
    0.57
    Act Density 0.214%

    No Known Activations