INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     geek
    -1.00
    geek
    -0.95
     nerd
    -0.91
     Geek
    -0.90
    Geek
    -0.85
     Nerd
    -0.78
     nerds
    -0.72
     surla
    -0.69
    Nerd
    -0.63
    nerd
    -0.60
    POSITIVE LOGITS
    findpost
    0.69
    market
    0.54
    0.53
    addCriterion
    0.51
    Spoljašnje
    0.50
    xo
    0.49
    hus
    0.48
    lines
    0.48
    іга
    0.47
    </thead>
    0.47
    Act Density 0.009%

    No Known Activations