INDEX
    Explanations

    expressions of strong admiration or fandom

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.68
    IntoConstraints
    -0.58
     Silla
    -0.57
    wußt
    -0.55
    Filmographie
    -0.55
    RenderAtEndOf
    -0.54
     plateado
    -0.52
    urlopen
    -0.52
     "];
    -0.52
     *);
    -0.52
    POSITIVE LOGITS
     lovers
    1.07
     lover
    1.02
     hobby
    0.99
     passion
    0.97
     love
    0.96
     appassion
    0.95
     Lovers
    0.94
     fans
    0.94
     fan
    0.93
     Lover
    0.93
    Act Density 0.294%

    No Known Activations