INDEX
Explanations
phrases indicating a strong or extreme contrast between different entities or situations
adjectives that indicate intensity or degree of contrast
New Auto-Interp
Negative Logits
irl
-0.69
uden
-0.69
yss
-0.66
yip
-0.65
bandwagon
-0.63
dearly
-0.62
ffen
-0.62
gem
-0.62
iren
-0.61
hirt
-0.60
POSITIVE LOGITS
increments
1.23
circumstances
1.06
fashion
1.05
quantities
1.01
proportions
1.01
terms
1.00
accordance
0.95
unison
0.93
haste
0.92
intervals
0.91
Activations Density 0.149%