INDEX
Explanations
words related to a reversible or opposite concept
references to various types of "verses" or poetic structures
New Auto-Interp
Negative Logits
urated
-0.86
arij
-0.78
RAW
-0.72
raped
-0.68
uracy
-0.68
Kut
-0.65
URA
-0.65
ritz
-0.65
istered
-0.65
icum
-0.64
POSITIVE LOGITS
verse
1.00
lihood
0.93
terday
0.85
theless
0.84
itudinal
0.70
vous
0.69
engers
0.69
vable
0.69
er
0.67
Collider
0.67
Activations Density 0.015%