INDEX
Explanations
words related to qualities or attributes such as "purity" or "quality"
terms related to the concept of "identity."
New Auto-Interp
Negative Logits
accompan
-0.85
soDeliveryDate
-0.68
ellipt
-0.68
referen
-0.65
hester
-0.64
Capt
-0.64
awaru
-0.62
prototype
-0.61
inav
-0.61
emetery
-0.61
POSITIVE LOGITS
ity
0.99
Peb
0.87
fulness
0.78
ilty
0.76
istically
0.74
lihood
0.74
yip
0.73
-------
0.71
ities
0.71
rics
0.70
Activations Density 0.017%