INDEX
    Explanations

    references to political leaders and events

    Follows prepositions or conjunctions

    New Auto-Interp
    Negative Logits
     rep
    -0.54
    .”
    -0.50
    5
    -0.47
    2
    -0.46
     am
    -0.46
     anti
    -0.46
     might
    -0.45
    7
    -0.45
     G
    -0.44
     Is
    -0.44
    POSITIVE LOGITS
    GraphicsUnit
    0.94
    rrggbb
    0.90
     mourut
    0.89
     houſe
    0.89
     étoient
    0.85
     avoient
    0.84
     enfans
    0.81
     pouvoit
    0.81
    IntoConstraints
    0.80
    mybatisplus
    0.79
    Act Density 0.647%

    No Known Activations