INDEX
    Explanations

    phrases related to political leadership and unity

    New Auto-Interp
    Negative Logits
    udos
    -0.48
    earchers
    -0.47
    idav
    -0.46
     Variant
    -0.45
    urai
    -0.45
    bnb
    -0.44
     spokesman
    -0.43
     à¨
    -0.43
    aez
    -0.43
    ische
    -0.43
    POSITIVE LOGITS
    $.
    0.64
    %.
    0.64
    }.
    0.58
    '.
    0.56
    )).
    0.55
    .�
    0.53
    ]).
    0.53
    ]."
    0.51
    .''.
    0.51
    ".
    0.51
    Act Density 8.823%

    No Known Activations