INDEX
    Explanations

    verbs and their variations related to visibility or presence

    New Auto-Interp
    Negative Logits
    oner
    -0.19
    ENCHMARK
    -0.15
    zk
    -0.15
    spir
    -0.15
    oning
    -0.14
    ereotype
    -0.14
    imo
    -0.14
    holm
    -0.14
    RNA
    -0.14
    eter
    -0.14
    POSITIVE LOGITS
    antly
    0.24
    ances
    0.24
    /dis
    0.22
    ees
    0.17
    åĭ¢
    0.16
    adox
    0.15
    ndo
    0.15
    ìĥĪ
    0.15
    ance
    0.15
    ANCES
    0.15
    Act Density 0.035%

    No Known Activations