INDEX
Explanations
comparative adjectives related to quality
New Auto-Interp
Negative Logits
andise
-0.94
ervative
-0.73
aint
-0.68
ADRA
-0.68
inosaur
-0.65
aukee
-0.63
CLE
-0.63
SU
-0.62
OGR
-0.61
itutional
-0.60
POSITIVE LOGITS
(<
0.98
ered
0.85
enthal
0.82
downs
0.79
est
0.79
case
0.78
down
0.76
trickle
0.75
observable
0.75
ppy
0.74
Activations Density 0.686%