INDEX
Explanations
phrases indicating beliefs or statements of fact
phrases expressing skepticism or disbelief
New Auto-Interp
Negative Logits
vette
-0.70
ouf
-0.67
arrivals
-0.66
åº
-0.64
WARE
-0.64
ENCY
-0.63
ADVERTISEMENT
-0.60
peria
-0.60
opter
-0.59
ratulations
-0.59
POSITIVE LOGITS
Matrix
0.71
loo
0.68
ById
0.68
alone
0.67
isSpecialOrderable
0.66
ohm
0.65
unes
0.65
washer
0.64
chy
0.63
raining
0.62
Activations Density 0.440%