INDEX
Explanations
technical markers within the text, possibly indicative of formatting or metadata
numeric values and their associated contexts
New Auto-Interp
Negative Logits
favor
-0.76
favorably
-0.74
purs
-0.74
hovah
-0.71
©¶æ
-0.71
ĪĴ
-0.68
honors
-0.68
behaviors
-0.68
solic
-0.67
favored
-0.67
POSITIVE LOGITS
However
1.25
Meanwhile
1.13
Topics
1.11
Speaking
1.09
Writing
1.08
But
1.07
Instead
1.06
Read
1.05
Having
1.04
Asked
1.04
Activations Density 0.490%