INDEX
    Explanations

    nouns and their associated numeric values in a historical or chronological context

    New Auto-Interp
    Negative Logits
    ¦
    -0.18
    iva
    -0.16
    oster
    -0.14
    ãĥ«ãĤ¯
    -0.14
    arton
    -0.14
    loth
    -0.14
    ÏĦιν
    -0.14
    apor
    -0.14
    dni
    -0.14
    ivol
    -0.14
    POSITIVE LOGITS
    igate
    0.16
    жи
    0.15
     refr
    0.15
    acular
    0.15
    lean
    0.15
    thon
    0.14
    _CN
    0.14
    iles
    0.14
    sole
    0.14
    spacer
    0.14
    Act Density 0.033%

    No Known Activations