INDEX
    Explanations

    phrases emphasizing the concept of significance or importance

    New Auto-Interp
    Negative Logits
     cle
    -0.18
    alent
    -0.17
    PropertyName
    -0.15
    indrical
    -0.15
    aliz
    -0.14
    998
    -0.14
    pedia
    -0.14
    ilim
    -0.14
     Screw
    -0.14
    997
    -0.14
    POSITIVE LOGITS
    uncios
    0.17
    fabric
    0.16
    gateway
    0.14
    ÑĥÑĪка
    0.14
    Ĥ¬
    0.14
    ucz
    0.14
    cole
    0.14
    richt
    0.14
    yun
    0.13
    ekk
    0.13
    Act Density 0.053%

    No Known Activations