INDEX
    Explanations

    phrases related to personal relationships and interactions

    New Auto-Interp
    Negative Logits
     Monfieur
    -0.99
     Chriſt
    -0.94
     purpoſe
    -0.89
     occaf
    -0.88
     greateſt
    -0.82
     ſeveral
    -0.82
     Majefty
    -0.80
     Perſ
    -0.79
     houſe
    -0.77
     Cæsar
    -0.77
    POSITIVE LOGITS
    '):
    
    0.78
    '),
    
    0.76
     Sa
    0.74
     Se
    0.73
    '])
    
    0.73
     rospy
    0.72
     θα
    0.72
    )$_
    0.72
    '))
    
    0.72
    initro
    0.72
    Act Density 0.041%

    No Known Activations