INDEX
Explanations
statements or articles beginning with "This."
occurrences of the word "This" and variations related to current events or updates
New Auto-Interp
Negative Logits
vae
-0.85
worms
-0.77
icons
-0.74
nets
-0.71
papers
-0.69
doms
-0.68
tops
-0.65
ickets
-0.64
rices
-0.64
imal
-0.63
POSITIVE LOGITS
week
1.15
month
1.02
year
0.99
latest
0.92
weekend
0.92
morning
0.90
afternoon
0.86
WEEK
0.83
century
0.81
milestone
0.81
Activations Density 0.228%