INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Towers
    -0.08
    .Ab
    -0.08
     finest
    -0.08
     towers
    -0.07
     samengesteld
    -0.07
     werfen
    -0.07
     Strike
    -0.07
     finis
    -0.07
     chcia
    -0.07
     behoren
    -0.07
    POSITIVE LOGITS
     importancia
    0.11
    importance
    0.10
     importância
    0.10
     أهمية
    0.09
    Importance
    0.09
     importance
    0.09
     penting
    0.08
    0.08
     중요
    0.08
    นิยม
    0.08
    Act Density 0.037%

    No Known Activations