INDEX
Explanations
links or prompts directing to additional information or sources
instances of the word "here" as a call to action or to direct readers to additional content
New Auto-Interp
Negative Logits
Ͻ
-0.88
manif
-0.81
overtime
-0.80
onga
-0.79
awakening
-0.78
footing
-0.78
¶ħ
-0.76
untarily
-0.73
©¶æ¥µ
-0.72
involuntary
-0.72
POSITIVE LOGITS
<|endoftext|>
1.05
aspx
1.02
Thanks
1.02
Also
1.00
Though
0.96
Subscribe
0.95
Please
0.95
Additionally
0.94
Alternatively
0.92
Retrieved
0.92
Activations Density 0.419%