INDEX
Explanations
specific references to accountability in political contexts
New Auto-Interp
Negative Logits
AddTagHelper
-0.80
BeginInit
-0.62
myſelf
-0.61
quæ
-0.51
Partagez
-0.50
munk
-0.49
themſelves
-0.48
parametrize
-0.48
-0.47
definitiv
-0.47
POSITIVE LOGITS
InstanceState
0.64
blah
0.63
Ooh
0.59
说不定
0.57
ooga
0.56
препратки
0.56
Oooh
0.56
tiens
0.56
__*/
0.55
przecież
0.54
Activations Density 0.523%