INDEX
    Explanations

    mentions of different languages and translations

    New Auto-Interp
    Negative Logits
    501
    -0.18
    636
    -0.16
    jes
    -0.15
     Ferry
    -0.15
    abor
    -0.14
    avage
    -0.14
     unix
    -0.14
    064
    -0.14
    bour
    -0.14
    umi
    -0.13
    POSITIVE LOGITS
    lient
    0.17
    ConverterFactory
    0.14
     Sexe
    0.14
     Leer
    0.14
    åħ¶ä¸Ń
    0.13
    ruta
    0.13
     Diss
    0.13
     Worlds
    0.13
    Pow
    0.13
     languages
    0.13
    Act Density 0.021%

    No Known Activations