INDEX
Explanations
the word "assume" at varying strength levels
the word "assume" and its variations in the context of making assumptions
New Auto-Interp
Negative Logits
Rebels
-0.64
stre
-0.61
Comm
-0.59
hod
-0.58
worthy
-0.58
Christmas
-0.58
zan
-0.57
reson
-0.57
Bee
-0.57
Wars
-0.56
POSITIVE LOGITS
assume
3.73
assumes
2.50
presume
2.43
assumed
2.12
assuming
1.97
suppose
1.74
assum
1.72
assumption
1.69
imply
1.48
conclude
1.44
Activations Density 0.013%