INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    !)
    0.47
    !)
    0.46
     craftsmanship
    0.40
     surprises
    0.37
     وکړئ
    0.37
     cheesy
    0.34
     香港
    0.34
    ~)
    0.34
     slowdown
    0.34
    !(
    0.33
    POSITIVE LOGITS
    ................
    1.25
    ...............
    1.25
    ............
    1.20
    ........
    1.19
     ..............
    1.18
    .........
    1.16
     .........
    1.16
    ..............
    1.15
    .............
    1.15
     ........
    1.12
    Act Density 0.016%

    No Known Activations