INDEX
Explanations
comparisons or contrasts between different elements or concepts
New Auto-Interp
Negative Logits
OUR
-0.69
âĹ¼
-0.63
GES
-0.62
eur
-0.62
Ire
-0.61
Sharp
-0.60
ALLY
-0.57
ORY
-0.56
pez
-0.56
chemical
-0.56
POSITIVE LOGITS
pires
1.12
pired
1.10
bestos
1.08
pects
1.05
piration
1.00
phalt
1.00
semble
1.00
opposed
0.99
piring
0.96
semb
0.96
Activations Density 10.123%