INDEX
    Explanations

    references to scientific discussions and parameters related to energy and physics

    New Auto-Interp
    Negative Logits
    à¹ĥ
    -0.07
    ÃĹ↵↵
    -0.07
    ordes
    -0.06
    ÌĪ
    -0.06
    UNG
    -0.06
    çĹ
    -0.06
    ört
    -0.06
     uch
    -0.06
     Maz
    -0.06
    quo
    -0.06
    POSITIVE LOGITS
     earlier
    0.17
     previously
    0.14
     Earlier
    0.13
     previous
    0.12
     Previously
    0.12
    Earlier
    0.12
     already
    0.12
    Previously
    0.12
    ear
    0.11
    previous
    0.10
    Act Density 0.205%

    No Known Activations