INDEX
Explanations
phrases related to official reports and statements
mention of reports or official findings
New Auto-Interp
Negative Logits
à¹
-0.74
Himself
-0.73
ãĥŀ
-0.72
Ghostbusters
-0.71
IME
-0.68
alogue
-0.64
icio
-0.64
ãĥ¬
-0.62
souven
-0.62
agram
-0.61
POSITIVE LOGITS
Recommend
0.92
findings
0.85
methodological
0.83
systematic
0.81
respondents
0.81
gaps
0.76
ificantly
0.76
statistically
0.75
disparities
0.75
citing
0.72
Activations Density 0.440%