INDEX
    Explanations

    references to a specific brand or product

    New Auto-Interp
    Negative Logits
    rolls
    -0.15
     Zaman
    -0.15
    cli
    -0.15
    ning
    -0.15
     gi
    -0.14
    oub
    -0.14
    o
    -0.14
    raith
    -0.14
    516
    -0.14
    out
    -0.14
    POSITIVE LOGITS
     fo
    0.32
     Fo
    0.28
    Fo
    0.28
    fo
    0.25
     FO
    0.20
    resh
    0.20
    ibles
    0.20
    obar
    0.19
    isted
    0.19
    aming
    0.17
    Act Density 0.015%

    No Known Activations