INDEX
    Explanations

    phrases related to specific entities being known for certain characteristics or actions

    phrases indicating a specific attribute or feature associated with various subjects

    New Auto-Interp
    Negative Logits
    Reviewed
    -0.81
    soever
    -0.75
    OTAL
    -0.75
    MRI
    -0.71
    York
    -0.70
    CNN
    -0.69
    Registered
    -0.66
    down
    -0.65
    Compare
    -0.65
    Correction
    -0.64
    POSITIVE LOGITS
    geries
    0.92
     having
    0.86
     cracking
    0.85
     producing
    0.84
     creating
    0.78
     delivering
    0.77
     being
    0.77
     exporting
    0.76
     discriminating
    0.76
     storing
    0.76
    Act Density 0.088%

    No Known Activations