INDEX
    Explanations

    specific references to accountability in political contexts

    New Auto-Interp
    Negative Logits
    AddTagHelper
    -0.80
    BeginInit
    -0.62
     myſelf
    -0.61
     quæ
    -0.51
     Partagez
    -0.50
    munk
    -0.49
     themſelves
    -0.48
    parametrize
    -0.48
    
    -0.47
     definitiv
    -0.47
    POSITIVE LOGITS
    InstanceState
    0.64
     blah
    0.63
    Ooh
    0.59
    说不定
    0.57
    ooga
    0.56
     препратки
    0.56
    Oooh
    0.56
     tiens
    0.56
    __*/
    0.55
     przecież
    0.54
    Act Density 0.523%

    No Known Activations