INDEX
Explanations
indications of a reduction or decrease in value or amount
the phrase "Less than" in various contexts
New Auto-Interp
Negative Logits
âĹ¼
-0.76
TAG
-0.65
ãĤ±
-0.63
DK
-0.62
supremacy
-0.61
Reconstruction
-0.60
Draft
-0.59
mberg
-0.58
Origins
-0.57
TRY
-0.57
POSITIVE LOGITS
ened
1.00
than
0.88
ening
0.84
reet
0.81
ensive
0.81
thumbnails
0.80
ons
0.80
erton
0.79
cipl
0.75
onest
0.74
Activations Density 0.039%