INDEX
Explanations
op-ed pieces discussing politics and social issues
New Auto-Interp
Negative Logits
esch
-0.14
alıyor
-0.13
ีว
-0.11
ãĥ©ãĥ¼
-0.11
ÙĨاء
-0.11
hof
-0.11
íķĻ기
-0.11
еÑĢÑĪ
-0.10
LError
-0.10
ียว
-0.10
POSITIVE LOGITS
Ed
1.22
Ed
1.15
ed
1.14
ED
1.10
-ed
1.05
.Ed
1.01
_ed
1.01
.ed
0.99
Edwards
0.93
_ED
0.90
Activations Density 0.294%