INDEX
    Explanations

    questions and their corresponding responses or answers

    New Auto-Interp
    Negative Logits
    それでも
    -0.65
     therefore
    -0.58
    writeFieldEnd
    -0.51
     nonetheless
    -0.49
     Therefore
    -0.49
    therefore
    -0.48
     asimismo
    -0.48
     inoltre
    -0.47
    TemporalType
    -0.47
     nevertheless
    -0.47
    POSITIVE LOGITS
    Actually
    1.16
     Nope
    1.10
    Nope
    1.10
     Actually
    1.09
     actually
    1.06
    actually
    1.05
     nope
    0.99
    Depends
    0.93
    Absolutely
    0.91
    nope
    0.90
    Act Density 0.429%

    No Known Activations