INDEX
Explanations
phrases indicating comparisons or contrasts between different options or features
Comes before descriptions of severity or magnitude
advanced, larger, higher
New Auto-Interp
Negative Logits
Италијани
-0.61
SIMPLE
-0.61
innocent
-0.60
Utilizamos
-0.59
简单
-0.58
VersionUID
-0.57
innocence
-0.56
sederhana
-0.56
простой
-0.56
Билгалдахарш
-0.55
POSITIVE LOGITS
advanced
1.34
larger
1.26
bigger
1.25
higher
1.24
advanced
1.20
Advanced
1.13
Advanced
1.13
Larger
1.10
ADVANCED
1.09
sophisticated
1.08
Activations Density 1.115%