INDEX
Explanations
statements reflecting personal opinions or subjective viewpoints
New Auto-Interp
Negative Logits
ermann
-0.15
horror
-0.14
grav
-0.14
eneral
-0.14
owell
-0.14
gone
-0.14
erman
-0.14
æ¿
-0.13
SSION
-0.13
MC
-0.13
POSITIVE LOGITS
smart
0.27
SMART
0.25
smart
0.24
wise
0.23
Smart
0.23
wisdom
0.22
Wise
0.22
.smart
0.21
èģ
0.21
_smart
0.21
Activations Density 0.011%