INDEX
    Explanations

    references to Korean culture and entertainment, particularly relating to food, K-pop, and dramas

    New Auto-Interp
    Negative Logits
    ians
    -0.15
    alus
    -0.15
    spath
    -0.14
    ichel
    -0.14
    endet
    -0.14
     Atkins
    -0.14
    ucc
    -0.14
    anke
    -0.14
     Ùħشار
    -0.14
    ers
    -0.13
    POSITIVE LOGITS
    Õ¡
    0.16
    atown
    0.15
    浩
    0.15
     fisse
    0.14
    jes
    0.14
    abyrinth
    0.14
    bow
    0.14
    jer
    0.14
    å¯Ĵ
    0.14
    æľŁå¾ħ
    0.14
    Act Density 0.036%

    No Known Activations