INDEX
    Explanations

    words indicating superiority or high quality

    New Auto-Interp
    Negative Logits
    contentLoaded
    -0.56
    okości
    -0.56
     mark
    -0.56
    IsInitialized
    -0.54
    ]]);
    -0.54
    \}\\
    -0.53
     Healey
    -0.53
    Continent
    -0.52
     Daarnaast
    -0.51
     verke
    -0.51
    POSITIVE LOGITS
     số
    2.45
     sô
    1.39
     Số
    1.33
    Số
    1.14
    0.94
     InputDecoration
    0.79
    AndEndTag
    0.75
    superior
    0.74
    0.74
     SUPERIOR
    0.73
    Act Density 0.061%

    No Known Activations