INDEX
Explanations
conditional phrases and questions
Text following periods
transition words
New Auto-Interp
Negative Logits
nahilalakip
-0.71
)";
-0.68
PhysRevLett
-0.68
%");
-0.66
Kitch
-0.65
()");
-0.64
tvguidetime
-0.63
¦
-0.62
nui
-0.60
//}
-0.59
POSITIVE LOGITS
Anyway
0.93
Btw
0.91
Anyway
0.83
Nowadays
0.82
nowadays
0.79
Moreover
0.76
Kindly
0.75
btw
0.74
Apart
0.74
Beside
0.73
Activations Density 0.447%