INDEX
Explanations
adverse descriptors indicating negativity or poor quality
New Auto-Interp
Negative Logits
ee
-0.16
eb
-0.15
endon
-0.15
dale
-0.15
_maximum
-0.15
cean
-0.14
inz
-0.14
ordial
-0.14
orr
-0.14
ová
-0.14
POSITIVE LOGITS
ger
0.34
gers
0.25
-news
0.24
dest
0.23
luck
0.22
ging
0.22
lands
0.20
GER
0.20
ged
0.19
ges
0.19
Activations Density 0.031%