INDEX
Explanations
phrases indicating a high level of certainty or emphasis
phrases that express a degree of certainty or specificity
New Auto-Interp
Negative Logits
eworks
-0.80
iddles
-0.79
packages
-0.77
###
-0.76
TeX
-0.75
thence
-0.75
usercontent
-0.75
>>>
-0.75
enders
-0.74
âĢ¢âĢ¢âĢ¢âĢ¢
-0.72
POSITIVE LOGITS
amount
1.46
extent
1.12
percentage
1.11
subset
1.07
number
1.06
degree
1.02
ties
0.95
kind
0.94
threshold
0.91
proportion
0.84
Activations Density 0.033%