INDEX
Explanations
phrases related to news headlines
instances of dashes or other symbols indicating breaks in text
New Auto-Interp
Negative Logits
wagen
-0.79
spir
-0.65
deals
-0.64
interstitial
-0.64
servicing
-0.63
bour
-0.62
stones
-0.62
agate
-0.61
heric
-0.61
range
-0.61
POSITIVE LOGITS
Comments
0.86
Advertisement
0.81
=-=-=-=-
0.73
––
0.73
Transcript
0.71
ADVERTISEMENT
0.70
Fever
0.69
=-=-=-=-=-=-=-=-
0.68
Edited
0.68
ĸļ
0.66
Activations Density 0.049%