INDEX
    Explanations

    proper nouns, specifically names of people and places

    New Auto-Interp
    Negative Logits
     infographic
    -0.80
     RELE
    -0.80
     Tablet
    -0.75
     Uni
    -0.74
     IPM
    -0.71
     wholesale
    -0.70
     photoc
    -0.70
     scheduling
    -0.68
     Bott
    -0.68
     footwear
    -0.67
    POSITIVE LOGITS
    anyahu
    0.92
    usky
    0.87
    orf
    0.87
    eden
    0.86
    din
    0.86
    oras
    0.84
    ison
    0.84
    ork
    0.84
    ivas
    0.84
    oval
    0.83
    Act Density 0.325%

    No Known Activations