INDEX
    Explanations

    phrases indicating perception or observation

    New Auto-Interp
    Negative Logits
    elsen
    -0.60
     Saw
    -0.60
    出版年
    -0.58
    xticks
    -0.58
     JSTOR
    -0.57
     Jacobsen
    -0.57
     SITES
    -0.57
    Wong
    -0.57
     Borgo
    -0.57
     lccn
    -0.57
    POSITIVE LOGITS
    ftagPool
    0.96
     فريبيس
    0.73
     <=",
    0.66
    IsContent
    0.65
    RectangleBorder
    0.65
    BASEPATH
    0.65
    Datuak
    0.62
    MockBean
    0.62
    مراجع
    0.61
     Carlisle
    0.61
    Act Density 0.034%

    No Known Activations