INDEX
Explanations
expressions indicating beliefs or opinions
phrases indicating belief or confidence in outcomes
New Auto-Interp
Negative Logits
disclosure
-0.80
disclosures
-0.75
testified
-0.69
deduction
-0.68
protested
-0.67
Occupations
-0.67
Brune
-0.67
mentions
-0.67
complains
-0.67
prohibitions
-0.67
POSITIVE LOGITS
destined
1.07
ready
1.06
poised
1.06
Ready
0.96
viable
0.95
unbeat
0.94
unstoppable
0.94
primed
0.91
achievable
0.91
ready
0.90
Activations Density 0.654%