INDEX
    Explanations

    expressions related to historical events and their significance

    New Auto-Interp
    Negative Logits
     Äįi
    -0.15
     Ñĩи
    -0.13
    -&
    -0.13
    phem
    -0.12
    atch
    -0.12
     oder
    -0.12
    abler
    -0.12
    -/
    -0.11
    quam
    -0.11
    _readable
    -0.11
    POSITIVE LOGITS
     and
    0.34
     vÃł
    0.31
    	and
    0.30
     ìŀĪê³ł
    0.29
     и
    0.29
     å¹¶
    0.27
     à¹ģละ
    0.27
    ìĿ´ê³ł
    0.27
    å¹¶
    0.26
     és
    0.24
    Act Density 5.770%

    No Known Activations