INDEX
Explanations
the word "Less" followed by a number
phrases indicating a low quantity or cost
New Auto-Interp
Negative Logits
ãĤ±
-0.67
uu
-0.62
transporter
-0.59
submitting
-0.58
este
-0.58
oco
-0.57
anka
-0.57
tricked
-0.56
smashing
-0.55
deceive
-0.55
POSITIVE LOGITS
Less
3.45
Less
3.01
Few
1.38
less
1.36
More
1.35
Greater
1.34
Lessons
1.33
lesser
1.24
LESS
1.16
Reduced
1.15
Activations Density 0.007%