INDEX
Explanations
occurrences of strong emotional responses or opinions
New Auto-Interp
Negative Logits
еÑģÑı
-0.15
Chapman
-0.14
ymous
-0.14
yd
-0.14
å®
-0.14
oks
-0.13
otte
-0.13
aries
-0.13
arious
-0.13
anner
-0.13
POSITIVE LOGITS
"
0.16
'al
0.16
rete
0.15
/renderer
0.14
'
0.14
_COMPAT
0.14
Ìģ
0.14
\Desktop
0.14
UES
0.13
ï¼ļ"
0.13
Activations Density 0.831%