INDEX
Explanations
phrases related to war and suffering
instances of hyphenated phrases or terms
New Auto-Interp
Negative Logits
opher
-0.69
oshop
-0.67
preference
-0.67
åŃIJ
-0.65
ously
-0.62
arena
-0.61
constitu
-0.61
umbers
-0.61
newcomer
-0.61
zbek
-0.60
POSITIVE LOGITS
_-
1.67
webkit
1.02
=-=-=-=-
0.86
âĢ¢âĢ¢
0.85
=-=-=-=-=-=-=-=-
0.83
taboola
0.78
/-
0.78
something
0.75
including
0.74
âĸº
0.74
Activations Density 0.069%