INDEX
Explanations
time-related expressions such as days, weeks, years, and months with a numeric value preceding them
temporal phrases indicating the passage of time
New Auto-Interp
Negative Logits
Flavoring
-0.71
podium
-0.66
Reviewer
-0.65
behavi
-0.62
gestures
-0.62
anus
-0.62
pronounce
-0.60
derog
-0.59
teammate
-0.58
Pact
-0.58
POSITIVE LOGITS
adobe
0.83
long
0.80
lot
0.76
days
0.75
frames
0.74
surrounding
0.72
liest
0.72
icter
0.68
preceding
0.68
lite
0.67
Activations Density 0.093%