INDEX
    Explanations

    historical names and significant figures

    New Auto-Interp
    Negative Logits
    abase
    -0.19
    prite
    -0.16
    OLON
    -0.15
    imore
    -0.15
    IBE
    -0.14
    olon
    -0.14
    бав
    -0.14
    .decorate
    -0.13
    iveau
    -0.13
    cion
    -0.13
    POSITIVE LOGITS
    gem
    0.16
    梨
    0.15
    127
    0.14
    urma
    0.14
    inin
    0.13
    /cop
    0.13
    _advanced
    0.13
     näch
    0.13
    bang
    0.13
     Plant
    0.13
    Act Density 0.035%

    No Known Activations