INDEX
Explanations
facts or statistics related to research and surveys
references to statistical findings or estimates related to various topics
New Auto-Interp
Negative Logits
comrade
-0.82
enment
-0.77
yours
-0.74
}}}
-0.70
comrades
-0.70
quished
-0.67
swer
-0.67
tv
-0.66
oleon
-0.66
glory
-0.64
POSITIVE LOGITS
overest
0.97
disproportionately
0.89
"â̦
0.88
clustered
0.86
significantly
0.85
outper
0.85
ificantly
0.84
median
0.84
"[
0.82
underestimated
0.79
Activations Density 0.740%