INDEX
Explanations
adjectives that describe the complexity or clarity of a concept
terms that indicate simplicity or clarity in complex contexts
New Auto-Interp
Negative Logits
deen
-0.62
thood
-0.60
ioxide
-0.60
position
-0.60
agine
-0.60
tein
-0.59
assies
-0.56
Statue
-0.56
ench
-0.54
Ķ
-0.53
POSITIVE LOGITS
enough
1.21
insofar
0.98
nonetheless
0.98
enough
0.87
because
0.81
but
0.81
indeed
0.80
nevertheless
0.79
;
0.79
despite
0.76
Activations Density 0.388%