INDEX
    Explanations

    modal verbs followed by be/verb

    New Auto-Interp
    Negative Logits
     nightlife
    1.27
     terrific
    1.21
     patriotism
    1.21
     carnage
    1.19
     murky
    1.18
     rambling
    1.16
     television
    1.16
     dozens
    1.14
     spectacular
    1.13
     sexuality
    1.13
    POSITIVE LOGITS
     be
    1.19
    denoted
    0.99
    Hence
    0.99
    create
    0.97
    not
    0.96
     تكون
    0.95
     dapat
    0.95
     mempunyai
    0.95
     يكون
    0.94
    could
    0.92
    Act Density 0.420%

    No Known Activations