INDEX
    Explanations

    questions starting with "Is" or "Are"

    New Auto-Interp
    Negative Logits
    CloseOperation
    -1.09
     itſelf
    -0.88
    AddTagHelper
    -0.84
     pleaſure
    -0.84
     Longfellow
    -0.83
     Ruman
    -0.83
     Româ
    -0.82
     themſelves
    -0.81
     ſever
    -0.81
    ucoup
    -0.81
    POSITIVE LOGITS
     Is
    1.12
    Is
    1.01
     IS
    0.95
     is
    0.95
    mIs
    0.84
     setIs
    0.83
     Was
    0.79
    bIs
    0.78
     Ishi
    0.78
    is
    0.78
    Act Density 0.170%

    No Known Activations