INDEX
    Explanations

    phrases indicating contrasts or comparisons

    New Auto-Interp
    Negative Logits
     apprehen
    -1.14
     reluct
    -1.12
     disagre
    -1.11
     disgra
    -1.11
     gaily
    -1.10
     reconno
    -1.02
     vainly
    -1.01
     tolerably
    -0.98
     inconce
    -0.97
     accla
    -0.97
    POSITIVE LOGITS
    теризу
    0.64
     EXPERIMENTS
    0.50
    rozco
    0.50
     abuelos
    0.50
    marginVertical
    0.50
    YOND
    0.49
    tifact
    0.49
     PLATES
    0.48
    0.48
    Voltaje
    0.48
    Act Density 0.310%

    No Known Activations