INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    =\"
    -0.71
     institution
    -0.68
    wings
    -0.66
     endeavor
    -0.66
     existence
    -0.63
     economy
    -0.62
    parts
    -0.61
    opia
    -0.61
     ballpark
    -0.61
    vation
    -0.61
    POSITIVE LOGITS
     Jr
    1.09
     Raphael
    1.04
     Geoff
    1.02
     Isabel
    0.99
     Gerald
    0.99
     Shant
    0.98
     Maurice
    0.98
     Katherine
    0.97
     Richie
    0.97
     Samuel
    0.97
    Act Density 0.109%

    No Known Activations