INDEX
Explanations
instances where something is being limited, furthered, or only allowed to a certain extent
words that indicate limitations or constraints
New Auto-Interp
Negative Logits
)|
-0.64
âĿ
-0.61
},{"-0.61
},"
-0.60
ilege
-0.60
oir
-0.60
Base
-0.59
oko
-0.57
onomy
-0.57
Hold
-0.56
POSITIVE LOGITS
preferring
1.15
suggesting
1.05
implying
0.97
noting
0.97
adding
0.93
culminating
0.92
spilling
0.90
emphasizing
0.89
prompting
0.89
echoing
0.86
Activations Density 0.421%