INDEX
Explanations
phrases related to rules, regulations, and arguments
conjunctions and transitional phrases in discourse
New Auto-Interp
Negative Logits
Gleaming
-0.80
ORY
-0.76
Heads
-0.74
RAD
-0.72
iaries
-0.70
ORPG
-0.69
guiActiveUnfocused
-0.68
ories
-0.67
è¦ļéĨĴ
-0.66
ãĤ´ãĥ³
-0.66
POSITIVE LOGITS
feas
0.92
alas
0.86
dare
0.85
somew
0.84
lege
0.81
necess
0.81
rouse
0.80
theoretically
0.79
someday
0.78
indeed
0.77
Activations Density 0.095%