INDEX
Explanations
comparisons using the word "than"
comparative phrases or expressions emphasizing "than."
New Auto-Interp
Negative Logits
ModLoader
-0.81
Juda
-0.74
Ire
-0.72
ilic
-0.69
Winged
-0.67
exemptions
-0.65
derog
-0.64
aird
-0.64
Contract
-0.63
enser
-0.63
POSITIVE LOGITS
atos
1.24
lihood
0.97
assis
0.83
xual
0.74
pload
0.73
ply
0.71
acles
0.70
itars
0.69
gs
0.69
tz
0.68
Activations Density 0.028%