INDEX
Explanations
phrases or terms related to decision-making and announcements
New Auto-Interp
Negative Logits
instead
-0.74
rall
-0.68
rather
-0.68
rather
-0.66
stayed
-0.65
instead
-0.62
pill
-0.62
impro
-0.61
seldom
-0.61
gul
-0.58
POSITIVE LOGITS
specifics
0.94
officially
0.90
nor
0.87
definitively
0.85
formally
0.82
satisf
0.78
definitive
0.78
yet
0.77
conclusive
0.75
finalized
0.74
Activations Density 0.113%