INDEX
Explanations
prepositions and conjunctions relating to various comparisons or metrics
statistics and numerical values related to sports performances
New Auto-Interp
Negative Logits
aido
-0.79
Rena
-0.70
crime
-0.63
Corrections
-0.63
worn
-0.62
skirts
-0.62
wash
-0.60
hower
-0.56
mature
-0.56
edom
-0.56
POSITIVE LOGITS
clusive
0.77
lihood
0.68
etheless
0.67
--+
0.67
mination
0.67
ven
0.65
lando
0.65
llor
0.65
iversal
0.64
incorpor
0.64
Activations Density 0.048%