INDEX
Explanations
instances of mentions of issues or problems
phrases that indicate complex societal or ethical issues
New Auto-Interp
Negative Logits
ournal
-0.80
ricanes
-0.79
thora
-0.78
gres
-0.77
undreds
-0.75
tsky
-0.75
uctions
-0.74
ousands
-0.74
ques
-0.74
aults
-0.73
POSITIVE LOGITS
albeit
0.87
insofar
0.83
hence
0.79
albeit
0.78
therefore
0.77
whereby
0.77
regardless
0.70
although
0.69
unlike
0.68
wherein
0.68
Activations Density 0.688%