INDEX
Explanations
positive adjectives and descriptors indicating high quality or excellence
positive quality descriptors
New Auto-Interp
Negative Logits
berdayakan
-0.33
umum
-0.31
perbuatan
-0.30
Ganzen
-0.30
itself
-0.28
kantoor
-0.28
voorkomen
-0.27
may
-0.27
oorsp
-0.27
time
-0.27
POSITIVE LOGITS
IndentedString
0.90
featureID
0.88
المعيارى
0.87
Vidite
0.86
disponibilités
0.79
0.75
<unused41>
0.74
<unused47>
0.74
<unused23>
0.74
<unused8>
0.74
Activations Density 0.014%