INDEX
    Explanations

    the word "or" in various contexts

    New Auto-Interp
    Negative Logits
    PDATE
    -0.74
     Candidate
    -0.69
    ETS
    -0.68
    EMP
    -0.67
    INS
    -0.65
     Redmond
    -0.65
    ESE
    -0.65
    ergic
    -0.64
    BLIC
    -0.59
    ONSORED
    -0.58
    POSITIVE LOGITS
    chard
    1.07
    acles
    1.07
    acular
    0.95
    acle
    0.93
    ific
    0.88
    phan
    0.86
    thodox
    0.85
    nam
    0.83
    anges
    0.79
     even
    0.78
    Act Density 0.040%

    No Known Activations