INDEX
Explanations
references to time periods, specifically the past year, and occasionally to shorter time frames
references to time periods, particularly years and decades
New Auto-Interp
Negative Logits
Saras
-0.62
Clause
-0.60
phr
-0.60
Hearts
-0.60
Fram
-0.60
breached
-0.58
breaching
-0.58
ogle
-0.57
sarc
-0.56
Selection
-0.56
POSITIVE LOGITS
long
0.85
frames
0.85
teenth
0.81
mill
0.75
eteenth
0.75
orate
0.74
SPONSORED
0.72
fuck
0.72
share
0.71
Mill
0.70
Activations Density 0.080%