INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    HB
    -0.73
    etheless
    -0.72
    fet
    -0.69
     rainy
    -0.67
     Patrol
    -0.62
    thur
    -0.60
     Dough
    -0.59
     chase
    -0.59
     hiking
    -0.59
    ozo
    -0.59
    POSITIVE LOGITS
    eering
    1.08
    eers
    0.93
     assemb
    0.90
     assemble
    0.83
    semble
    0.82
     assembled
    0.81
    eer
    0.81
     assemblies
    0.78
     assembling
    0.77
    ÃįÃį
    0.77
    Act Density 0.042%

    No Known Activations