INDEX
    Explanations

    phrases expressing curiosity or surprise regarding situations

    explaining why or giving reasons

    New Auto-Interp
    Negative Logits
     שוליים
    -0.70
     gyhoeddwyd
    -0.49
     Wiktionnaire
    -0.47
    ècie
    -0.47
    İstinadlar
    -0.47
    دانشنامهٔ
    -0.46
    Sucesor
    -0.44
     testSet
    -0.44
    -0.44
    PostMapping
    -0.42
    POSITIVE LOGITS
     reason
    0.51
    难怪
    0.50
     why
    0.48
     ragione
    0.46
     razão
    0.46
     razón
    0.45
    latego
    0.44
     therefore
    0.44
    sprechend
    0.43
     reasons
    0.43
    Act Density 0.124%

    No Known Activations