INDEX
    Explanations

    common descriptors and modifiers related to simple concepts or conditions

    New Auto-Interp
    Negative Logits
    StandardItem
    -0.16
    ABCDEFGHIJKLMNOP
    -0.16
    ternet
    -0.16
    anut
    -0.15
    ARB
    -0.15
     Adresse
    -0.14
    TER
    -0.14
    ctl
    -0.14
    ijd
    -0.13
     Byl
    -0.13
    POSITIVE LOGITS
    Ħ
    0.17
    ector
    0.15
    #index
    0.15
    å´İ
    0.14
     Conv
    0.14
     conv
    0.14
    asi
    0.14
     ÑĤой
    0.13
    .runner
    0.13
     Laws
    0.13
    Act Density 0.001%

    No Known Activations