INDEX
    Explanations

    proper nouns, especially names and organizations

    New Auto-Interp
    Negative Logits
     اÙĦØ«
    -0.15
    Ỽi
    -0.15
    ória
    -0.15
    oodle
    -0.15
    nton
    -0.15
    xBF
    -0.14
    _DETECT
    -0.14
    unei
    -0.14
    EventListener
    -0.14
    atura
    -0.14
    POSITIVE LOGITS
     SG
    0.23
     AG
    0.22
    tog
    0.21
     Lag
    0.21
    LAG
    0.20
    agog
    0.20
    lg
    0.19
     Kag
    0.19
    AGR
    0.18
     LG
    0.18
    Act Density 0.183%

    No Known Activations