INDEX
Explanations
comparative phrases indicating a lesser amount or degree
comparative phrases indicating relationships or differences in terms of intensity or degree
New Auto-Interp
Negative Logits
ALE
-0.71
obal
-0.70
bilt
-0.69
erman
-0.66
imity
-0.64
raine
-0.64
ISSION
-0.62
rio
-0.62
Juda
-0.62
itia
-0.61
POSITIVE LOGITS
usual
1.39
anticipated
1.00
usual
1.00
expected
0.97
anything
0.93
ever
0.85
typical
0.84
average
0.77
advertised
0.77
any
0.76
Activations Density 0.089%