INDEX
    Explanations

    references to specific popular culture elements or brands

    New Auto-Interp
    Negative Logits
     Lodge
    -0.15
     Howe
    -0.14
    Ĭ
    -0.14
    ies
    -0.14
    ầu
    -0.13
    opsis
    -0.13
    spl
    -0.13
    NSE
    -0.13
     Spl
    -0.13
     Bale
    -0.13
    POSITIVE LOGITS
    ktop
    0.16
    onica
    0.15
    /+
    0.15
    /cop
    0.15
     Guys
    0.14
    swire
    0.14
    dio
    0.14
    igm
    0.14
    IRROR
    0.14
    jab
    0.14
    Act Density 0.058%

    No Known Activations