INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    tové
    -0.81
    UVEN
    -0.80
     chefs
    -0.78
    -0.77
    rité
    -0.77
    uteria
    -0.77
    omoto
    -0.77
    issue
    -0.77
    uska
    -0.76
     viaje
    -0.76
    POSITIVE LOGITS
     lovers
    3.22
     enthusiasts
    2.94
     lover
    2.70
     fans
    2.53
     enthusiast
    2.50
     aficionados
    2.27
     loving
    1.88
     devotees
    1.85
     Lovers
    1.77
     afic
    1.77
    Act Density 0.061%

    No Known Activations