INDEX
Explanations
phrases related to political and social issues
New Auto-Interp
Negative Logits
effic
-0.84
diseases
-0.73
advant
-0.72
efficiency
-0.71
laure
-0.69
habitat
-0.69
Columb
-0.69
competition
-0.68
shortages
-0.68
ahime
-0.68
POSITIVE LOGITS
excerpts
1.17
reprinted
1.14
itled
1.12
published
1.07
excerpt
1.03
redacted
1.03
leaked
1.00
headlined
0.98
circulated
0.98
edited
0.96
Activations Density 15.079%