INDEX
Explanations
references to time, particularly in the form of "ago"
New Auto-Interp
Negative Logits
dominant
-0.64
'.$
-0.61
Sa
-0.58
"").
-0.58
dominant
-0.57
']}
-0.54
OR
-0.54
current
-0.54
điều
-0.52
F
-0.51
POSITIVE LOGITS
ago
3.93
AGO
1.96
Ago
1.90
geleden
1.85
Ago
1.66
ago
1.64
назад
1.48
年前
1.28
AGO
1.27
atrás
1.21
Activations Density 0.031%