INDEX
    Explanations

    institution names or references to a specific university

    New Auto-Interp
    Negative Logits
    istically
    -0.87
    istical
    -0.79
    ufact
    -0.77
    acular
    -0.76
    raviolet
    -0.75
    merce
    -0.75
    ileaks
    -0.73
    oulder
    -0.71
    anza
    -0.71
    destruct
    -0.69
    POSITIVE LOGITS
     cooker
    1.06
    cloth
    0.90
    washer
    0.87
     Rice
    0.77
     cakes
    0.77
     Kris
    0.77
    cook
    0.76
    bowl
    0.76
    ption
    0.74
     bowls
    0.74
    Act Density 0.026%

    No Known Activations