INDEX
    Explanations

    HTML attributes and tags

    New Auto-Interp
    Negative Logits
     doubtnut
    -0.98
     ―――――
    -0.90
     Anſ
    -0.90
     ARXIV
    -0.88
     Theſe
    -0.87
     Monfieur
    -0.87
     Diſ
    -0.85
     (\<
    -0.83
     myſelf
    -0.83
     Jefus
    -0.83
    POSITIVE LOGITS
    ="
    0.86
     "
    0.86
    "
    0.82
    ?
    0.77
    0.73
    )="
    0.72
    ...
    0.72
    endphp
    0.70
     “
    0.69
    ?"
    0.69
    Act Density 0.167%

    No Known Activations