INDEX
Explanations
phrases indicating contrast or comparison
phrases that emphasize exclusivity or limitation
New Auto-Interp
Negative Logits
»Ĵ
-0.93
EStream
-0.68
³
-0.67
etc
-0.66
ĸ
-0.65
UGE
-0.62
ij士
-0.61
glomer
-0.61
Various
-0.60
nect
-0.60
POSITIVE LOGITS
oped
0.75
physically
0.69
tem
0.65
schild
0.63
ional
0.63
survive
0.62
obe
0.61
ifiable
0.60
visually
0.59
financially
0.58
Activations Density 0.038%