INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Vezi
    -0.44
     muñecas
    -0.44
     muñ
    -0.40
     rosca
    -0.39
     vorder
    -0.36
    baikan
    -0.35
     Absch
    -0.34
     oyn
    -0.34
     Bupati
    -0.34
     Distrito
    -0.34
    POSITIVE LOGITS
     energy
    2.11
    Energy
    2.06
     Energy
    2.05
    energy
    2.00
     ENERGY
    1.94
    ENERGY
    1.90
     energia
    1.69
     Energie
    1.65
     energía
    1.60
     énergie
    1.56
    Act Density 0.078%

    No Known Activations