INDEX
    Explanations

    references to numbers or numerical concepts

    New Auto-Interp
    Negative Logits
    leÅŁik
    -0.17
    ä¸Ī
    -0.15
     kür
    -0.15
    icari
    -0.15
     latter
    -0.14
    åĪļæīį
    -0.14
    radu
    -0.14
    å±Ĭ
    -0.13
    rego
    -0.13
    istique
    -0.13
    POSITIVE LOGITS
     bibli
    0.16
    :
    0.15
    ↵↵
    0.15
     Append
    0.15
     append
    0.15
    âĢĥ
    0.14
     _:
    0.14
    Append
    0.14
     Gutenberg
    0.14
     Conclusion
    0.14
    Act Density 0.054%

    No Known Activations