INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _tweet
    -0.07
     іншого
    -0.07
     what
    -0.07
     Cors
    -0.07
    what
    -0.07
     Str
    -0.07
    (/^
    -0.06
     ballpark
    -0.06
    شت
    -0.06
     пись
    -0.06
    POSITIVE LOGITS
     Ivan
    0.07
     even
    0.07
     Evan
    0.07
    eny
    0.07
     kunnen
    0.06
    EV
    0.06
     LV
    0.06
    even
    0.06
    TableView
    0.06
    Invalid
    0.06
    Act Density 0.017%

    No Known Activations