INDEX
Explanations
quantitative measurements appearing as fractions or percentages
phrases indicating proportions or ratios
New Auto-Interp
Negative Logits
Few
-0.66
andr
-0.66
few
-0.63
Cosponsors
-0.62
Middle
-0.62
too
-0.61
0002
-0.60
among
-0.60
951
-0.59
strong
-0.59
POSITIVE LOGITS
GDP
1.05
total
0.91
what
0.84
capacity
0.79
its
0.74
normal
0.71
maximum
0.71
original
0.68
xon
0.68
their
0.65
Activations Density 0.095%