INDEX
Explanations
phrases related to quantities and proportions
phrases expressing conditionality and comparisons
New Auto-Interp
Negative Logits
INCLUD
-0.64
idon
-0.62
ALWAYS
-0.62
ãĤº
-0.58
ame
-0.58
conflic
-0.57
ESE
-0.57
havoc
-0.55
MUST
-0.55
ãĥī
-0.54
POSITIVE LOGITS
marginally
1.10
briefly
1.09
occasional
1.03
spor
0.98
peripher
0.97
handful
0.96
modest
0.95
minor
0.94
rudimentary
0.93
sporadic
0.93
Activations Density 0.291%