INDEX
Explanations
phrases containing the term "fold"
phrases that describe complexity and clarity in explanations or arguments
New Auto-Interp
Negative Logits
Instr
-0.74
renovations
-0.70
broom
-0.66
shuttle
-0.60
disbanded
-0.58
manned
-0.58
Rolls
-0.58
wardrobe
-0.56
yles
-0.56
refurb
-0.56
POSITIVE LOGITS
:[
0.97
.
0.96
:
0.92
.):
0.92
because
0.91
insofar
0.89
*:
0.87
considering
0.87
empir
0.83
:-
0.83
Activations Density 0.290%