INDEX
    Explanations

    phrases that indicate significance or importance

    New Auto-Interp
    Negative Logits
     Chwiliwch
    -0.86
    \}\\
    -0.82
     nahilalakip
    -0.73
     חיצוניים
    -0.73
    ^(@)
    -0.69
    </caption>
    -0.68
    %%
    
    -0.68
    contentLoaded
    -0.67
     Савезне
    -0.66
    urrent
    -0.65
    POSITIVE LOGITS
    ificance
    0.98
    Significance
    0.90
     Significance
    0.90
    importance
    0.86
     significance
    0.85
     importance
    0.85
    Importance
    0.81
     Importance
    0.80
     importancia
    0.71
    StandardCharsets
    0.65
    Act Density 0.012%

    No Known Activations