INDEX
    Explanations

    absolutely followed by emphatic adjective

    New Auto-Interp
    Negative Logits
    ول
    0.59
    Schedule
    0.53
     zwią
    0.53
     زمان
    0.50
    CAL
    0.50
    पुढे
    0.50
    for
    0.48
     سینٹی
    0.48
     naps
    0.47
    Koh
    0.47
    POSITIVE LOGITS
     Estas
    0.64
     considerada
    0.56
     SELECTED
    0.55
     Игра
    0.55
     Revolución
    0.54
     American
    0.54
     Playhouse
    0.53
     harassed
    0.53
     OrderedDict
    0.53
     म्हट
    0.52
    Act Density 0.002%

    No Known Activations