INDEX
Explanations
time-related expressions, specifically durations or timestamps
numerical references related to time and duration
New Auto-Interp
Negative Logits
onductor
-0.70
srf
-0.69
ufact
-0.64
Caption
-0.64
icter
-0.62
urden
-0.61
underside
-0.60
Prev
-0.58
Information
-0.58
depiction
-0.57
POSITIVE LOGITS
bucks
1.43
guys
1.10
minutes
1.08
dudes
1.08
dollars
1.08
thousand
1.08
goddamn
1.07
fuckin
1.04
cents
1.02
mins
1.01
Activations Density 0.242%