INDEX
    Explanations

    proper nouns, particularly names and places

    New Auto-Interp
    Negative Logits
    ified
    -0.98
    rified
    -0.95
    lot
    -0.83
    riter
    -0.81
    ifies
    -0.81
    egg
    -0.80
    urgy
    -0.79
    imil
    -0.79
    binding
    -0.78
    blade
    -0.77
    POSITIVE LOGITS
    UAL
    0.87
    elson
    0.73
     Centauri
    0.72
    querade
    0.71
    uality
    0.71
     Gupta
    0.69
    BILITY
    0.69
    OPLE
    0.68
    ñ
    0.67
    agement
    0.67
    Act Density 0.205%

    No Known Activations