INDEX
    Explanations

    words related to contributions and effects on a community or society

    New Auto-Interp
    Negative Logits
    lac
    -0.17
    urger
    -0.14
    æ¯
    -0.14
    cie
    -0.14
    .chomp
    -0.14
    orf
    -0.14
    arp
    -0.14
     telesc
    -0.13
    opa
    -0.13
    oton
    -0.13
    POSITIVE LOGITS
     toward
    0.31
     towards
    0.31
     Towards
    0.23
    Towards
    0.22
     directly
    0.22
     significant
    0.18
     Tow
    0.18
     factors
    0.16
    åIJij
    0.16
     verso
    0.16
    Act Density 0.018%

    No Known Activations