INDEX
    Explanations

    terms related to physical objects or activities

    New Auto-Interp
    Negative Logits
    teen
    -0.24
    teenth
    -0.20
    cut
    -0.17
    tes
    -0.17
    fully
    -0.16
    ta
    -0.16
    -ÑĤаки
    -0.16
    amba
    -0.16
     eens
    -0.16
    tery
    -0.15
    POSITIVE LOGITS
    jamin
    0.23
    forth
    0.22
    issance
    0.20
    egal
    0.20
    ultimate
    0.20
    ial
    0.19
    /disable
    0.18
    igma
    0.17
    folk
    0.17
    à§įà¦
    0.16
    Act Density 0.278%

    No Known Activations