INDEX
    Explanations

    phrases that denote excitement and enjoyment related to activities and experiences

    New Auto-Interp
    Negative Logits
    меÑĩ
    -0.18
    WARDS
    -0.16
    ersist
    -0.15
    bilt
    -0.14
    igate
    -0.14
    ÏĩεδÏĮν
    -0.14
    ORIES
    -0.14
     Pron
    -0.14
    опиÑģ
    -0.14
    -Owned
    -0.14
    POSITIVE LOGITS
     Shields
    0.17
     '
    0.16
     rein
    0.15
     proverb
    0.14
    prech
    0.14
     Zimmer
    0.14
     lev
    0.14
    arp
    0.14
    Callable
    0.14
     strictly
    0.14
    Act Density 0.317%

    No Known Activations