INDEX
Explanations
phrases indicating a degree of certainty or comparison
phrases indicating degrees of uncertainty or ambiguity
New Auto-Interp
Negative Logits
Puzz
-0.62
LINE
-0.60
Ratings
-0.60
ETS
-0.60
edia
-0.58
Pony
-0.58
ocrats
-0.58
Novel
-0.56
Cipher
-0.56
sheets
-0.55
POSITIVE LOGITS
nam
1.25
chard
1.25
nery
1.17
lando
1.12
acle
1.08
acles
1.05
chid
1.04
leans
1.03
acular
1.02
phan
1.00
Activations Density 0.068%