INDEX
Explanations
punctuation marks and sentence boundaries
New Auto-Interp
Negative Logits
kefeller
-0.83
democrat
-0.74
sche
-0.73
challeng
-0.72
sustainable
-0.71
diseng
-0.71
insur
-0.71
affili
-0.70
prey
-0.66
conscience
-0.66
POSITIVE LOGITS
Initially
1.17
Its
1.14
However
1.13
Previously
1.08
Normally
1.05
It
1.04
Firstly
1.03
Each
1.03
Although
1.03
Though
1.03
Activations Density 0.549%