INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     outils
    -0.10
     tools
    -0.09
     strumenti
    -0.09
    tools
    -0.08
     Tools
    -0.08
    工具
    -0.08
     dieting
    -0.08
    _tools
    -0.08
     madre
    -0.08
     Sac
    -0.08
    POSITIVE LOGITS
    0.09
    experienced
    0.09
    aday
    0.08
    Fog
    0.08
     gelatin
    0.08
    áló
    0.08
     novela
    0.07
    abilidades
    0.07
    -modern
    0.07
    	GL
    0.07
    Act Density 0.001%

    No Known Activations