INDEX
Explanations
dates or timestamps from online posts or publications
instances of the word "Posted" followed by numbers, indicating dates and times
New Auto-Interp
Negative Logits
atern
-0.81
fell
-0.81
isky
-0.80
ugal
-0.75
ishi
-0.73
oshenko
-0.73
aternity
-0.72
ppard
-0.72
ivo
-0.71
$$$$
-0.71
POSITIVE LOGITS
Posted
0.83
erick
0.75
Thumbnails
0.73
monton
0.68
Comments
0.68
itors
0.66
Kiw
0.64
Parade
0.64
vertis
0.64
Sun
0.63
Activations Density 0.010%