INDEX
Explanations
adjectives related to fairness or justifiability
occurrences of the word "just" and its variations in different contexts
New Auto-Interp
Negative Logits
xual
-0.63
pora
-0.62
Licensed
-0.62
2020
-0.61
challeng
-0.61
ccording
-0.60
antage
-0.60
Palestin
-0.60
Archdemon
-0.58
necks
-0.58
POSITIVE LOGITS
ifiable
1.53
ifications
1.32
ified
1.14
ification
1.05
IFIED
0.96
ifiers
0.96
if
0.95
ifying
0.94
icia
0.90
ifier
0.88
Activations Density 0.093%