INDEX
    Explanations

    phrases discussing efficacy and assessments of policies or treatments

    Comes after "while"

    New Auto-Interp
    Negative Logits
     either
    -0.80
    Either
    -0.79
     even
    -0.78
    either
    -0.77
     Either
    -0.77
    even
    -0.71
     prostu
    -0.65
     inoltre
    -0.65
     invece
    -0.65
     entweder
    -0.65
    POSITIVE LOGITS
     technically
    1.31
     nominally
    1.17
     ostensibly
    1.08
     outwardly
    1.03
     admittedly
    1.00
     theoretically
    0.99
     superfic
    0.96
     initially
    0.92
     may
    0.91
     téc
    0.88
    Act Density 0.504%

    No Known Activations