INDEX
    Explanations

    terms that describe physical attributes, movements, or conditions

    New Auto-Interp
    Negative Logits
    INGTON
    -0.14
    airy
    -0.13
    pedia
    -0.13
     جÙĦ
    -0.13
    ê²
    -0.12
    /al
    -0.12
    à¸Ļà¸ķ
    -0.12
    端
    -0.12
    .swap
    -0.11
     právÄĽ
    -0.11
    POSITIVE LOGITS
    å¹¹ç·ļ
    0.17
    iclass
    0.15
    wner
    0.14
    ÏģιÏĥ
    0.14
    pNet
    0.14
    uropean
    0.14
    ippi
    0.13
    monds
    0.13
    ulse
    0.13
    neider
    0.13
    Act Density 0.052%

    No Known Activations