INDEX
Explanations
references to peace activists and related concepts
New Auto-Interp
Negative Logits
DebuggerStep
-0.48
nhật
-0.46
>{@-0.46
теристики
-0.44
propOrder
-0.43
Picchu
-0.41
للمعارف
-0.41
guapa
-0.41
Слу
-0.40
脚注の使い方
-0.39
POSITIVE LOGITS
violence
0.72
violence
0.66
Violence
0.63
Violence
0.63
peace
0.57
conflict
0.56
peace
0.56
violent
0.55
conflict
0.54
Konflikt
0.53
Activations Density 0.391%