INDEX
    Explanations

    specific names, categories, and jargon related to various subjects, particularly in film and scientific contexts

    New Auto-Interp
    Negative Logits
    HasAnnotation
    -0.54
    RTLR
    -0.49
    новниш
    -0.48
     ujednoznacz
    -0.47
     Goodyear
    -0.46
     Laval
    -0.44
     BIBSYS
    -0.43
     Evol
    -0.43
     Klopp
    -0.43
    StoryboardSegue
    -0.43
    POSITIVE LOGITS
     gay
    0.46
     sak
    0.43
    :✨
    0.43
     مرئيه
    0.42
     sake
    0.42
     Gay
    0.41
     lain
    0.41
     GAY
    0.39
    нитар
    0.39
     snippetHide
    0.38
    Act Density 0.185%

    No Known Activations