INDEX
Explanations
punctuation and sentence structure elements in the text
New Auto-Interp
Negative Logits
ogonal
-0.15
μί
-0.15
totalCount
-0.15
cope
-0.14
quare
-0.14
909
-0.14
']!='
-0.13
vap
-0.13
Declare
-0.13
RuleContext
-0.13
POSITIVE LOGITS
think
0.57
Think
0.54
think
0.52
Think
0.52
consider
0.45
imagine
0.43
look
0.42
Imagine
0.38
Imagine
0.36
THINK
0.35
Activations Density 0.256%