INDEX
Explanations
arguments and discussions related to public policy and social issues
New Auto-Interp
Negative Logits
âķIJ
-0.73
abouts
-0.71
£ı
-0.67
Selected
-0.67
Latest
-0.67
abo
-0.66
ety
-0.66
ãĥ¯ãĥ³
-0.66
ulhu
-0.66
»Ĵ
-0.64
POSITIVE LOGITS
especially
1.08
thereby
1.07
especially
1.04
exacerbate
1.02
impair
1.00
particularly
0.98
albeit
0.97
worsen
0.97
particularly
0.94
exacerb
0.94
Activations Density 0.276%