INDEX
Explanations
statements regarding clarity and uncertainty in contexts, particularly related to future events or outcomes
New Auto-Interp
Negative Logits
seless
-0.63
pearl
-0.63
Godd
-0.62
misogyn
-0.60
odied
-0.59
Femin
-0.59
femin
-0.58
discriminated
-0.57
blasphemy
-0.57
superiority
-0.57
POSITIVE LOGITS
timetable
1.08
deadlines
0.90
tentative
0.89
deadline
0.88
2019
0.86
timeframe
0.82
TBD
0.82
uncertainties
0.81
roadmap
0.80
TBA
0.80
Activations Density 1.437%