INDEX
Explanations
explicit statements of clarification or emphasis
phrases indicating clarity or explicitness in statements
New Auto-Interp
Negative Logits
luck
-0.72
ickle
-0.67
Luck
-0.65
luck
-0.65
average
-0.65
Faul
-0.64
digy
-0.63
Derby
-0.60
crane
-0.59
Luck
-0.58
POSITIVE LOGITS
displeasure
0.92
unequivocally
0.87
emphatically
0.86
intention
0.85
disapproval
0.83
willingness
0.83
intentions
0.80
ariat
0.80
commitment
0.79
disdain
0.79
Activations Density 0.201%