INDEX
Explanations
updates or revisions within a text
instances of updates or announcements related to events or statuses
New Auto-Interp
Negative Logits
athered
-0.78
ographies
-0.69
stood
-0.68
anship
-0.67
sbm
-0.66
uay
-0.65
isable
-0.62
vast
-0.61
userc
-0.61
cific
-0.60
POSITIVE LOGITS
:
1.01
Feb
0.98
Sept
0.94
02
0.92
July
0.92
07
0.92
04
0.91
06
0.91
June
0.90
Aug
0.90
Activations Density 0.027%