INDEX
Explanations
phrases that indicate conditionality or dependencies
New Auto-Interp
Negative Logits
Horst
-0.67
Huerta
-0.64
centralwidget
-0.64
æa
-0.62
Gauthier
-0.61
पत्र
-0.59
Burch
-0.58
çı
-0.58
peteer
-0.57
깥
-0.57
POSITIVE LOGITS
Depends
1.38
depends
1.36
depending
1.30
depends
1.29
depend
1.29
Depends
1.25
depended
1.25
depend
1.23
depending
1.20
depende
1.11
Activations Density 0.128%