INDEX
Explanations
phrases related to news articles or reports, specifically those with dates and sources
occurrences of date and location references
New Auto-Interp
Negative Logits
prol
-0.61
next
-0.60
worms
-0.60
fame
-0.60
attent
-0.58
conversion
-0.58
onics
-0.58
Spartans
-0.57
Sixth
-0.57
brakes
-0.57
POSITIVE LOGITS
Jonathan
1.05
String
1.03
Jason
0.99
David
0.99
Mike
0.98
Carl
0.97
Joshua
0.95
Jim
0.95
Luc
0.92
Kevin
0.92
Activations Density 0.021%