INDEX
    Explanations

    Perception and noticing

    New Auto-Interp
    Negative Logits
     jums
    -0.08
     nuk
    -0.08
    <long
    -0.08
     вент
    -0.07
    ethode
    -0.07
     kòm
    -0.07
     whoever
    -0.07
    -0.07
    ogolo
    -0.07
     sèche
    -0.07
    POSITIVE LOGITS
     atentos
    0.10
    0.09
     reconnaissance
    0.09
     novidades
    0.09
     vigilance
    0.09
    0.09
     kansen
    0.09
    0.09
     herkennen
    0.09
     anyar
    0.09
    Act Density 0.015%

    No Known Activations