INDEX
Explanations
references to individuals who have held previous positions of power or authority
New Auto-Interp
Negative Logits
IsContent
-0.78
GEBURTSDATUM
-0.74
httphttps
-0.70
ſtre
-0.69
يتيمه
-0.68
faſt
-0.67
KommentareTeilen
-0.65
ſta
-0.63
Reſ
-0.63
ſche
-0.62
POSITIVE LOGITS
former
1.95
Former
1.78
Former
1.77
former
1.63
ehemalige
1.48
ehemaligen
1.46
mantan
1.34
býval
1.28
ehemal
1.24
formerly
1.05
Activations Density 0.232%