INDEX
    Explanations

    comparative phrases or constructs emphasizing similarity or equality

    New Auto-Interp
    Negative Logits
     Sama
    -0.60
     Dla
    -0.58
     Judah
    -0.58
    olu
    -0.57
    YourGuide
    -0.57
    ","","
    -0.56
     Jackson
    -0.56
     Jacobs
    -0.55
     Gloria
    -0.55
    いません
    -0.54
    POSITIVE LOGITS
    principalTable
    0.79
    原始内容存档于
    0.76
    Tembelea
    0.75
     possible
    0.75
    ashier
    0.75
    InjectAttribute
    0.73
     EconPapers
    0.72
     llamo
    0.72
     Argyle
    0.72
    onAttach
    0.71
    Act Density 0.074%

    No Known Activations