INDEX
    Explanations

    words related to lists or inventories

    New Auto-Interp
    Negative Logits
    rink
    -0.15
    è³
    -0.15
    lob
    -0.14
     blobs
    -0.14
    rzy
    -0.14
    upo
    -0.14
    tra
    -0.14
    gen
    -0.14
    beeld
    -0.13
    rogram
    -0.13
    POSITIVE LOGITS
    asca
    0.18
    oard
    0.16
    Ĥæķ°
    0.15
    imitive
    0.15
    -inline
    0.15
    ÙĨدÙĩ
    0.15
    uci
    0.15
    áli
    0.14
    uced
    0.14
    .pix
    0.14
    Act Density 0.001%

    No Known Activations