INDEX
Explanations
words related to rules, accountability, and justice
questions related to societal issues or conflicts
New Auto-Interp
Negative Logits
breaker
-0.76
fray
-0.74
dding
-0.73
çīĪ
-0.72
Already
-0.70
orthy
-0.70
result
-0.68
swick
-0.68
Meanwhile
-0.68
ridor
-0.68
POSITIVE LOGITS
Marijuana
0.85
Artificial
0.83
vegetarian
0.82
bicycl
0.81
marijuana
0.81
atheists
0.80
graphene
0.80
diets
0.78
GMOs
0.76
cryptocurrencies
0.76
Activations Density 1.036%