INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     unbedingt
    0.64
    页面存档备份
    0.64
    जाने
    0.64
    ført
    0.63
    IBAction
    0.63
     հատ
    0.62
    ://$
    0.61
    Exact
    0.61
    絶対に
    0.60
     repeatability
    0.59
    POSITIVE LOGITS
     welcome
    1.98
     Welcome
    1.78
    Welcome
    1.73
     greetings
    1.73
     hello
    1.58
     greeting
    1.51
    elcome
    1.50
    welcome
    1.50
     Greetings
    1.45
     bienvenue
    1.44
    Act Density 0.325%

    No Known Activations