INDEX
    Explanations

    television programming details and ratings

    New Auto-Interp
    Negative Logits
     Roy
    -0.15
     Sor
    -0.15
    ;display
    -0.15
     bless
    -0.14
     Esta
    -0.14
    á»Ĺi
    -0.14
    anden
    -0.13
    ¼åIJĪ
    -0.13
    楽
    -0.13
    ł
    -0.13
    POSITIVE LOGITS
    ï¼ļ"
    0.15
    bah
    0.15
    ÃŃsk
    0.15
    _REMOTE
    0.14
     chaud
    0.14
    Tween
    0.14
     effort
    0.14
    ãĥ¼ãĤ¯
    0.14
    ecz
    0.14
     Laud
    0.13
    Act Density 0.007%

    No Known Activations