INDEX
Explanations
negative phrases and descriptions related to perception and judgments about people or situations
New Auto-Interp
Negative Logits
réception
-0.66
navidad
-0.54
veu
-0.54
difíciles
-0.52
dewasa
-0.52
taget
-0.52
-0.51
razón
-0.51
recepción
-0.51
extérieur
-0.50
POSITIVE LOGITS
rawDesc
1.11
pure
1.02
HasAnnotation
1.01
✨:
0.97
Tikang
0.94
cyklopedia
0.88
Portale
0.87
AnchorStyles
0.87
setViewportView
0.87
PreferredItem
0.85
Activations Density 0.625%