INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     모두
    1.02
    いずれ
    0.94
    Overall
    0.93
    更多
    0.85
    どちら
    0.83
    Two
    0.83
    H
    0.82
    Both
    0.81
     كلها
    0.80
    E
    0.80
    POSITIVE LOGITS
     kinds
    1.70
     sorts
    1.60
    igators
    1.53
    usions
    1.46
    iances
    1.45
    uding
    1.44
     aspects
    1.40
     facets
    1.38
    iteration
    1.35
    lllll
    1.34
    Act Density 0.280%

    No Known Activations