INDEX
Explanations
statements expressing opinions or positions on political or social issues
New Auto-Interp
Negative Logits
!'
-0.62
sqor
-0.60
Byte
-0.58
Illust
-0.57
llular
-0.56
Streets
-0.56
Entry
-0.56
bask
-0.56
gasp
-0.55
Bunny
-0.55
POSITIVE LOGITS
"[
1.22
"â̦
1.13
"'
1.12
"...
1.11
"
1.10
"#
0.99
regretted
0.94
''
0.92
"(
0.91
""
0.85
Activations Density 7.541%