INDEX
Explanations
adjectives and adverbs indicating quality or effectiveness
New Auto-Interp
Negative Logits
increments
-0.17
Slinky
-0.16
{{--<-0.15
ivec
-0.15
erotique
-0.14
addCriterion
-0.14
.FC
-0.14
orang
-0.14
amac
-0.14
.fc
-0.14
POSITIVE LOGITS
ast
0.24
has
0.22
has
0.21
ad
0.20
quanto
0.19
than
0.19
s
0.18
anybody
0.18
ae
0.17
HAS
0.17
Activations Density 0.044%