INDEX
Explanations
phrases and keywords related to societal issues and community dynamics
New Auto-Interp
Negative Logits
,)
-0.33
.)
-0.28
###
-0.27
/*↵
-0.27
.")
-0.26
###
-0.26
.)↵
-0.25
,)↵
-0.24
/*↵
-0.24
.')
-0.24
POSITIVE LOGITS
".↵
0.32
".↵↵
0.29
.hpp
0.29
”.↵
0.28
_HPP
0.28
”.↵↵
0.27
".↵
0.26
)".
0.26
".
0.25
ÙijÙİ
0.25
Activations Density 0.093%