INDEX
Explanations
phrases relating to the impact of human actions on health and societal issues
New Auto-Interp
Negative Logits
.
-0.63
Naissance
-0.53
ويكيميديا
-0.50
izzata
-0.48
lgari
-0.42
していきます
-0.42
s
-0.42
都不是
-0.42
!
-0.41
kosi
-0.41
POSITIVE LOGITS
itſelf
0.87
ConstraintMaker
0.86
بلکه
0.82
myſelf
0.79
iſt
0.71
himſelf
0.71
sondern
0.70
arşivlendi
0.69
estekak
0.69
Monfieur
0.67
Activations Density 0.281%