INDEX
Explanations
adjectives related to judgment or assessment
phrases indicating perceptions of reality, contrasting beliefs, or contradictory statements
New Auto-Interp
Negative Logits
istries
-0.73
partName
-0.72
cies
-0.71
imore
-0.70
tails
-0.68
ispers
-0.68
edin
-0.67
umption
-0.67
imer
-0.66
iland
-0.66
POSITIVE LOGITS
ellectual
0.72
"'
0.69
"â̦
0.69
imped
0.68
inferior
0.67
superior
0.67
"
0.66
unworthy
0.64
BuyableInstoreAndOnline
0.62
practicable
0.62
Activations Density 0.248%