INDEX
Explanations
phrases related to diverse opinions, freedom of expression, and open discourse
discussions about political opinions and the right to express them
New Auto-Interp
Negative Logits
©¶æ¥µ
-0.75
mallow
-0.75
Storage
-0.72
Nurs
-0.71
Married
-0.71
wastewater
-0.70
Rehab
-0.70
kidnapping
-0.70
Magicka
-0.70
aceae
-0.69
POSITIVE LOGITS
viewpoints
1.67
opinions
1.67
opin
1.43
opinion
1.40
truths
1.34
dissenting
1.33
disagree
1.33
views
1.30
disagrees
1.30
conclusions
1.29
Activations Density 0.659%