INDEX
Explanations
phrases referring to time or timeline events
instances of the word "now."
New Auto-Interp
Negative Logits
SourceFile
-0.89
ãĥ¥
-0.73
è¦ļéĨĴ
-0.71
Cause
-0.66
portfolio
-0.64
ļéĨĴ
-0.64
portfolios
-0.64
alogy
-0.64
conom
-0.60
Deal
-0.59
POSITIVE LOGITS
however
0.95
alas
0.80
we
0.79
adays
0.78
THERE
0.77
despite
0.77
thankfully
0.77
according
0.76
somew
0.72
although
0.72
Activations Density 0.131%