INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     suitable
    -0.07
    buy
    -0.07
    ackage
    -0.06
    ίνη
    -0.06
     достаточно
    -0.06
     Mixer
    -0.06
     stove
    -0.06
    -0.06
    	fclose
    -0.06
    zon
    -0.06
    POSITIVE LOGITS
     empowering
    0.15
     empower
    0.14
     empowered
    0.13
     empowerment
    0.12
     inv
    0.07
     Aw
    0.07
    owers
    0.06
    .GetMapping
    0.06
     Vander
    0.06
    0.06
    Act Density 0.005%

    No Known Activations