INDEX
    Explanations

    words used when explaining geometric proofs

    New Auto-Interp
    Negative Logits
    estic
    -0.07
    enso
    -0.06
    ạn
    -0.06
    ĥn
    -0.06
    istros
    -0.06
    éry
    -0.06
    isper
    -0.06
    eneric
    -0.06
    ola
    -0.06
    moid
    -0.06
    POSITIVE LOGITS
     vertex
    0.11
     point
    0.09
    vertex
    0.08
     apex
    0.08
    Vertex
    0.07
     shared
    0.07
     source
    0.07
     corner
    0.07
     central
    0.07
    _vertex
    0.07
    Act Density 0.226%

    No Known Activations