INDEX
Explanations
references to urgency or time-related concepts, particularly involving the term "rush."
New Auto-Interp
Negative Logits
ija
-0.17
izabeth
-0.16
achel
-0.16
жд
-0.16
pecific
-0.16
ht
-0.15
zcze
-0.15
hani
-0.15
opers
-0.14
agues
-0.14
POSITIVE LOGITS
-hour
0.22
die
0.22
hour
0.21
Rush
0.18
hour
0.18
Lim
0.18
rush
0.17
Hour
0.17
rod
0.16
rushes
0.16
Activations Density 0.020%