INDEX
    Explanations

    proper names or entities, specifically focusing on the name "Ralph"

    the name "Ralph" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    ning
    -0.83
    glers
    -0.81
    ly
    -0.78
    hift
    -0.73
    ners
    -0.70
    strap
    -0.69
    kers
    -0.69
    ned
    -0.66
    lift
    -0.66
    gerald
    -0.63
    POSITIVE LOGITS
    onso
    1.13
    onse
    1.03
    abet
    0.98
     Lauren
    0.90
    inating
    0.83
    oqu
    0.80
     Miliband
    0.79
    abetic
    0.77
     Wald
    0.77
    ieri
    0.77
    Act Density 0.095%

    No Known Activations