INDEX
Explanations
phrases related to high levels or rates of something
references to high levels or degrees of various attributes or conditions
New Auto-Interp
Negative Logits
ription
-0.79
tnc
-0.72
herer
-0.71
oute
-0.67
itars
-0.67
ource
-0.64
ocene
-0.63
izon
-0.63
irit
-0.62
vier
-0.62
POSITIVE LOGITS
percentage
1.21
degree
1.15
levels
1.07
percentages
1.06
probability
1.05
incidence
1.04
proportion
1.04
rate
1.03
rates
1.03
level
1.02
Activations Density 0.070%