INDEX
    Explanations

    references to significant entities or themes within a text

    New Auto-Interp
    Negative Logits
    åde
    -0.15
     robin
    -0.15
    978
    -0.14
     Disclosure
    -0.14
    ingles
    -0.14
    Disclosure
    -0.14
    udson
    -0.14
    atables
    -0.14
     Ses
    -0.13
     Math
    -0.13
    POSITIVE LOGITS
    CM
    0.18
    èħ°
    0.16
    kop
    0.15
     prec
    0.15
    CP
    0.15
    anda
    0.14
    heel
    0.14
    адÑĥ
    0.14
     CM
    0.14
    į
    0.14
    Act Density 0.014%

    No Known Activations