INDEX
Explanations
references to political figures and their actions or affiliations
New Auto-Interp
Negative Logits
ÑĨеп
-0.09
.updateDynamic
-0.09
ãĥ§
-0.08
liž
-0.08
ɵ
-0.08
ört
-0.07
ForRow
-0.07
subjects
-0.07
endum
-0.07
/**č↵
-0.07
POSITIVE LOGITS
former
0.28
Former
0.24
former
0.23
Former
0.22
býval
0.16
retired
0.15
erst
0.14
formerly
0.13
ex
0.12
سابÙĤ
0.11
Activations Density 0.095%