INDEX
Explanations
superlatives or extremes, such as "ultimate", "new", or "perfect"
terms that denote superiority or excellence
New Auto-Interp
Negative Logits
hire
-0.85
rehend
-0.80
imester
-0.80
related
-0.78
attr
-0.76
ription
-0.76
laden
-0.76
VIEW
-0.75
cies
-0.74
onis
-0.74
POSITIVE LOGITS
antidote
0.93
arbit
0.87
embodiment
0.87
underdog
0.85
culprit
0.85
conduit
0.83
catalyst
0.83
beneficiary
0.83
gateway
0.82
Trojan
0.81
Activations Density 0.252%