INDEX
Explanations
phrases related to news headlines or current events
significant predictive statements about potential events or outcomes
New Auto-Interp
Negative Logits
_.
-0.81
example
-0.77
76561
-0.76
assum
-0.76
them
-0.76
thereof
-0.72
Niet
-0.72
âĶĢâĶĢâĶĢâĶĢ
-0.71
ãĢĤ
-0.71
thing
-0.70
POSITIVE LOGITS
honoured
0.76
watchdog
0.75
declass
0.75
Wednesday
0.75
Thursday
0.72
unveiled
0.70
cybersecurity
0.68
apologised
0.68
renewed
0.68
Tuesday
0.68
Activations Density 0.523%