INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    tale
    -0.88
    Relations
    -0.76
     Cosponsors
    -0.75
    κ
    -0.70
    ahn
    -0.67
    steen
    -0.67
    abiding
    -0.66
    reek
    -0.66
    ãĤ©
    -0.66
    relations
    -0.64
    POSITIVE LOGITS
     laptop
    1.09
    aptop
    1.07
     laptops
    1.07
     computers
    0.96
     MacBook
    0.93
     PCs
    0.90
     charger
    0.87
    pad
    0.87
     computer
    0.87
     CPUs
    0.86
    Act Density 0.030%

    No Known Activations