INDEX
Explanations
phrases that indicate ambiguity or vagueness
New Auto-Interp
Negative Logits
enberg
-0.06
unal
-0.06
enal
-0.06
mell
-0.06
มà¸Ń
-0.06
Watt
-0.06
illon
-0.06
dikke
-0.06
ourg
-0.05
:Boolean
-0.05
POSITIVE LOGITS
vague
0.14
vag
0.13
broad
0.12
general
0.11
generic
0.10
-general
0.10
generic
0.10
specificity
0.10
konkrét
0.10
general
0.10
Activations Density 0.074%