INDEX
    Explanations

    specific abbreviations and acronyms related to television series and media references

    New Auto-Interp
    Negative Logits
    ndl
    -0.15
    avec
    -0.15
    anford
    -0.15
     desc
    -0.15
    hausen
    -0.14
    .camel
    -0.14
    etas
    -0.14
    äºŃ
    -0.14
    amik
    -0.14
    ainer
    -0.14
    POSITIVE LOGITS
     Newman
    0.15
    Attrib
    0.14
    put
    0.14
     yı
    0.13
    راÙĨÙĩ
    0.13
    rina
    0.13
    Binder
    0.13
    fos
    0.13
    ativa
    0.13
    alsy
    0.13
    Act Density 0.008%

    No Known Activations