INDEX
Explanations
phrases indicating a comparison or contrast
phrases that use the word "as" in various contexts
New Auto-Interp
Negative Logits
å½
-0.70
uin
-0.65
oes
-0.64
ãĥ¥
-0.63
UE
-0.63
WI
-0.62
oe
-0.62
TAIN
-0.60
======
-0.60
POST
-0.60
POSITIVE LOGITS
pired
0.98
ylum
0.97
bestos
0.92
leep
0.84
ynchron
0.82
phalt
0.81
part
0.77
ocial
0.74
ifles
0.74
pires
0.73
Activations Density 0.062%