INDEX
    Explanations

    numerical representations and references to quantities or figures

    New Auto-Interp
    Negative Logits
    åĬ¡
    -0.16
    iad
    -0.14
    rd
    -0.13
     Chat
    -0.13
    offee
    -0.13
    ton
    -0.13
     Commons
    -0.13
    lla
    -0.13
    -0.13
    kening
    -0.13
    POSITIVE LOGITS
    swire
    0.18
    leground
    0.17
    .gf
    0.16
    ichtig
    0.16
     newcom
    0.16
    ubre
    0.15
    ENCIES
    0.15
    اÙĬر
    0.15
    ajo
    0.15
    ignKey
    0.15
    Act Density 0.042%

    No Known Activations