INDEX
    Explanations

    names followed by a verb or description

    New Auto-Interp
    Negative Logits
     heaters
    0.36
    करण
    0.34
    ทาน
    0.34
     화면
    0.33
    wpi
    0.33
     permas
    0.32
     bgcolor
    0.32
    Fant
    0.32
     माता
    0.32
    으로
    0.31
    POSITIVE LOGITS
    ский
    0.43
     রচিত
    0.42
     Emeritus
    0.40
     unveils
    0.40
     mógł
    0.39
    ův
    0.39
    педії
    0.39
     सचिवालय
    0.39
    oglu
    0.39
     shortstop
    0.38
    Act Density 0.044%

    No Known Activations