INDEX
    Explanations

    specific words or phrases related to cultural or artistic subjects, especially in languages other than English

    New Auto-Interp
    Negative Logits
     ÐĹд
    -0.14
    lady
    -0.14
    gger
    -0.13
    bjerg
    -0.13
    é²ľ
    -0.13
    jerne
    -0.13
    FRING
    -0.13
    oyal
    -0.13
    atz
    -0.13
    dden
    -0.13
    POSITIVE LOGITS
     Governors
    0.16
    ãĥ¼ãĥIJ
    0.15
    üsü
    0.14
    uta
    0.14
    /dc
    0.14
    ìĽIJìĿĺ
    0.14
    ilos
    0.13
    uts
    0.13
     (“
    0.13
     Hamp
    0.13
    Act Density 0.095%

    No Known Activations