INDEX
Explanations
numbers and symbols at the end of a text segment
instances of articles and references to ongoing stories or important issues
New Auto-Interp
Negative Logits
revol
-0.82
challeng
-0.82
rhy
-0.81
unstoppable
-0.78
enthusi
-0.78
tremend
-0.78
democrat
-0.78
¥ŀ
-0.77
paddle
-0.77
enriched
-0.77
POSITIVE LOGITS
But
1.38
Experts
1.36
The
1.34
According
1.33
However
1.32
Asked
1.31
Officials
1.30
While
1.29
It
1.28
Among
1.28
Activations Density 0.132%