INDEX
Explanations
phrases that express preference or comparison
New Auto-Interp
Negative Logits
ToBounds
-0.87
Vij
-0.71
Williamson
-0.66
uuidv
-0.66
regla
-0.65
ActivityCompat
-0.65
labelledby
-0.64
Lue
-0.63
Hermans
-0.63
зу
-0.62
POSITIVE LOGITS
uttosto
1.24
Rather
1.06
rather
1.05
Rather
1.03
rather
0.89
***/
0.88
guère
0.82
Bemer
0.82
Fairly
0.81
PLWABN
0.81
Activations Density 0.083%