INDEX
    Explanations

    conjunctions and phrases indicating connections or inclusivity

    New Auto-Interp
    Negative Logits
    odge
    -0.15
    umont
    -0.15
    iple
    -0.14
    395
    -0.14
    ango
    -0.14
    ou
    -0.14
    endra
    -0.14
    kbd
    -0.13
     ìŀĪê³ł
    -0.13
    ato
    -0.13
    POSITIVE LOGITS
     alike
    0.18
    peater
    0.17
    akov
    0.16
    ãĥģ
    0.15
    eração
    0.15
     Hund
    0.15
    /***************************************************************************↵
    0.15
    icont
    0.15
    finalize
    0.14
    ÑĢаÑĩ
    0.14
    Act Density 0.152%

    No Known Activations