INDEX
    Explanations

    calculate averages and statistics

    New Auto-Interp
    Negative Logits
    ennzeichnet
    0.39
    0.39
     वायू
    0.39
    Allocation
    0.39
    तंत्र
    0.38
     учре
    0.38
    တော်
    0.38
     किताबों
    0.38
    自行车
    0.37
     علیحد
    0.37
    POSITIVE LOGITS
     average
    1.60
     Average
    1.55
     averages
    1.53
    Average
    1.52
    平均
    1.48
     평균
    1.43
    average
    1.38
     平均
    1.36
     AVERAGE
    1.35
     avg
    1.34
    Act Density 0.042%

    No Known Activations