INDEX
    Explanations

    references to literary figures and works

    New Auto-Interp
    Negative Logits
    شة
    -0.16
    лада
    -0.15
    etro
    -0.15
    occo
    -0.14
    ordo
    -0.14
    awi
    -0.13
    odo
    -0.13
     unh
    -0.13
    ldata
    -0.13
     Bieber
    -0.13
    POSITIVE LOGITS
    ův
    0.16
    ibase
    0.15
    athon
    0.15
    rella
    0.15
    Interstitial
    0.15
    ynet
    0.14
    haar
    0.14
    iali
    0.14
     Ends
    0.14
     Extras
    0.14
    Act Density 0.082%

    No Known Activations