INDEX
    Explanations

    descriptive words followed by nouns

    New Auto-Interp
    Negative Logits
     decidedly
    0.38
     admittedly
    0.29
    实际上
    0.29
     ostensibly
    0.29
     عبدالله
    0.28
     schoolchildren
    0.28
    0.27
     unwitting
    0.27
    0.27
    <unused2206>
    0.27
    POSITIVE LOGITS
     equipments
    0.77
     stuffs
    0.71
     evidences
    0.66
     sufferings
    0.59
     advices
    0.59
     feedbacks
    0.57
     appare
    0.55
     functionalities
    0.55
     personnels
    0.54
     logics
    0.53
    Act Density 1.247%

    No Known Activations