INDEX
Explanations
phrases related to high levels of something, such as concentration, quality, success, cost, and disparities
phrases indicating high levels of intensity or significant challenges
New Auto-Interp
Negative Logits
udder
-0.79
orthy
-0.71
Alternatively
-0.69
whichever
-0.68
Mb
-0.68
ilaterally
-0.68
Sham
-0.67
SB
-0.66
IER
-0.66
iland
-0.66
POSITIVE LOGITS
nature
1.37
popularity
1.15
ness
1.00
lack
1.00
similarities
0.99
availability
0.98
tendency
0.97
prevalence
0.95
availability
0.95
absence
0.94
Activations Density 0.371%