INDEX
Explanations
mentions of specific events or incidents
occurrences of punctuation marks, particularly periods, indicating the end of sentences
New Auto-Interp
Negative Logits
tremend
-0.95
subsidized
-0.71
harbor
-0.70
gobl
-0.67
oggles
-0.66
corrid
-0.65
leveled
-0.65
honoring
-0.64
ãĤ¼ãĤ¦ãĤ¹
-0.64
stabilization
-0.64
POSITIVE LOGITS
↵
1.30
<|endoftext|>
1.04
®
1.03
However
0.96
Pict
0.95
Whilst
0.94
↵↵
0.91
Alternatively
0.86
Shape
0.85
Ministers
0.84
Activations Density 0.297%